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(ix) TELECOMMUNICATION INFORMATION: 
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(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20710 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: N 
(iv) ANTI -SENSE: N 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
TCGAGTCGGA GAGTTGGCAC AGGC CTTGAG CTCGCTGTGA CGTTCTCACG GTGTTGGSTG 
GGATCAGCTG GTGACTCAGA CAAGTCTTGA GCTCTACAAC GTAACATACG GGCTGATGC 
CACCCGATAC CAGAATTACG CAGTCGGCAA TTCTGTGCCC TAGAGTCACC TCAAAGAATA 
ATCTGTGGTG TCCAAGGGGA GGGTTCTGGG GCCGGCTACT TAGAAACCGC CATAGATCGG 
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GCAGGGTGGA GTACTTGAGG AGCCGGCGGT AGGTGGCCAG GTGGGCCCGG TTACCTGCTC 3 00 

. TTTTGCGTGC TGOTGGAAGC CTGCTCAGGG ATTTCTTAAC CTCGGCCTCG GTTGGACGTA 360 

CCATGG CAGA AGGCOGTTTT GGAGCGGACT CGGTGGGGCG C'GGCGGAGAA AAGGCCTCTG 42 0 

TGACTAGGGG AGGCAgVtGG GACTTGGGGA GCTCGGACGA CGAATCAAGC ACCTCCACAA 480 

CCAGCACGGA TATGGACG^C CTCCCTGAGG AGAGGAAACC ACTAACGGGA AAGTCTGTAA 54 0 

AAACCTCGTA CATATACGAC\GTGCCCACCG TCCCGACCAG CAAGCCGTGG CATTTAATGC 6 00 

ACGACAACTC CCTCTACGCA A^GCCTAGGT TTCCGCCCAG ACCTCTCATA CGGCACCCTT 66 0 

CCGAAAAAGG CAGCATTTTT GCOK3TCGGT TGTCAG CGAC TGACGACGAC TCGGGAGACT 72 0 

ACGCGCCAAT GGATCGCTTC GCCTTtCAGA GCCCCAGGGT GTGTGGTCGC CCTCCCCTTC 780 

CGCCTCCAAA TCACCCACCT CCGGCAHCTA GGCCGGCAGA CGCGTCAATG GGGGACGTGG 84 0 

GCTGGGCGGA TCTGCAGGGA CTCAAGAGGA CCCCAAAGGG ATTTTTAAAA ACATCTACCA 900 

AGGGGGGCAG TCTCAAAGCC CGTGGACGCgNaTGTAGGTGA CCGTCTCAGG GACGGCGGCT 960 

TTGCCTTTAG TCCTAGGGGC GTGAAATCTG OCATAGGGCA AAACATTAAA TCATGGTTGG 1020 

GGATCGGAGA ATCATCGGCG ACTGCTGTCC CCOTCACCAC GCAGCTTATG GTACCGGTGC 1080 

ACCTCATTAG AACGCCTGTG ACCGTGGACT ACAGGAATGT TTATTTGCTT TACTTAGAGG 114 0 

GGGTAATGGG TGTGGGCAAA TCAACGCTGG TCAACGOCGT GTGCGGGATC TTGCCCCAGG 12 00 

AGAGAGTGAC AAGTTTTCCC GAGCCCATGG TGTACTGg\c GAGGGCATTT ACAGATTGTT 1260 

ACAAGGAAAT TTCCCACCTG ATGAAGTCTG GTAAGGCGGgVaGACCCGCTG ACGTCTGCCA 1320 

AAATATACTC ATGCCAAAAC AAGTTTTCGC TCCCCTTCCG GACGAACGCC ACCGCTATCC 1380 

TGCGAATGAT GCAGCCCTGG AACGTTGGGG GTGGGTCTGG GAGGGGCACT CACTGGTGCG 144 0 

T CTTTG AT AG GCATCTCCTC TCCCCAGCAG TGGTGTTCCC TCTOkTGCAC CTGAAGCACG 15 00 

GCCGCCTATC TTTTGATCAC TTCTTTCAAT TACTTTCCAT CTTTAQAGCC ACAGAAGGCG 1560 

ACGTGGTCGC CATTCTCACC CTCTCCAGCG CCGAGTCGTT GCGGCGG3TC AGGGCGAGGG 1620 

GAAGAAAGAA CGACGGGACG GTGGAGCAAA ACTACATCAG AGAATTGGCk TGGGCTTATC 1680 

ACGCCGTGTA CTGTTCATGG ATCATGTTGC AGTACATCAC TGTGGAGCAG YtGGTACAAC 174 0 

TATGCGTACA AACCACAAAT ATTCCGGAAA TCTGCTTCCG CAGCGTGCGC CTGGCACACA 18 00 

AGGAGGAAAC TTTGAAAAAC CTTCACGAGC AGAGCATGCT ACCTATGATC ACCOGTGTAC 1860 

TGGATCCCGT GAGACATCAT CCCGTCGTGA TCGAGCTTTG CTTTTGTTTC TTCAOK3AGC 1920 

TGAGAAAATT ACAATTTATC GTAGCCGACG CGGATAAGTT CCACGACGAC GTATGCG^CC 1980 

TGTGGACCGA AATCTACAGG CAGATCCTGT CCAATCCGGC TATTAAACCC AGGGCCATOA 2 04 0 
ACTGGCCAGC ATT AGAGAG C CAGTCTAAAG CAGTTAATCA CCTAGAGGAG ACATGCAGGgX 2100 
TCTAGCCTTC TTGGCGGCCC TTGCATGCTG GCGATGCATA TCGTTGACAT GTGGAGCCAC \ 2160 

TGGCGCGTTG CCGACAACGG CGACGACAAT AACCCGCTCC GCCACGCAGC TCATCAATGG X? 20 

GAGAACCAAC CTCTCCATAG AACTGGAATT CAACGGCACT AGTTTTTTTC TAAATTGGCA 22B£ 
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AAATCTGTTG AATGTCATCA CGGAGCCGGC CCTGACAGAG TTGTGGACCT CCGCCGAAGT 234 0 

CGCCGAGGAC CTCAOGGTAA CTCTGAAAAA GAGGCAAAGT " CTTTTTTTCC CCAACAAGAC 24 00 

AGTTGTGATC TCTGGA&ACG GCCATCGCTA TACGTGCGAG GTGCCGACGT CGTCGCAAAC 2460 

TTATAACATC AC CAAGGGCT TTAACTATAG CGCTCTGCCC GGGCACCTTG GCGGATTTGG 252 0 

GATCAACGCG CGTCTGGTACS TGGGTGATAT CTTCGCATCA AAATGGTCGC TATTCG CGAG 258 0 

GGACACCCCA GAGTATCGGG VgTTTTACCC AATGAATGTC ATGGCCGTCA AGTTTTCCAT 264 0 

ATCCATTGGC AACAACGAGT COGGCGTAGC GCTCTATGGA GTGGTGTCGG AAGATTTCGT 2 700 

GGTCGTCACG CTCCACAACA GGtYcAAAGA GGCTAACGAG ACGGCGTCCC ATCTTCTGTT 2760 

CGGTCTCCCG GATTCACTGC CATCtCTGAA GGGCCATGCC AC CTATGATG AACTCACGTT 2820 

CGCCCGAAAC GCAAAATATG CGCTAGTCGC GATCCTGCCT AAAGATTCTT ACCAGACACT 288 0 

C CTTACAGAG AATTACACTC GCATATTTCT GAACATGACG GAGTCGACGC CCCTCGAGTT 2 94 0 

CACGCGGACG ATCCAGACCA GGATCGTAt\ AATCGAGGCC AGGCGCGCCT GCGCAGCTCA 3 000 

AGAGGCGGCG CCGGACATAT TCTTGGTGTT TSTTTCAGATG TTGGTGGCAC ACTTTCTTGT 3060 

TGCGCGGGGC ATTGCCGAGC AC CGATTTGT GGAGGTGGAC TGCGTGTGTC GGCAGTATGC 3120 

GGAACTGTAT TTTCTCCGCC GCATCTCGCG TCTkTGCATG CCCACGTTCA CCACTGTCGG 3180 

GTATAACCAC ACCACCCTTG GCGCTGTGGC CGCCflCACAA ATAGCTCGCG TGTCCGCCAC 3240 

GAAGTTGG CC AGTTTGCCCC GCTCTTCCCA GGAAACAGTG CTGG CCATGG TC CAGCTTGG 3 3 00 

CGCCCGTGAT GGCGCCGTCC CTTCCTCCAT TCTGGAGGfec ATTGCTATGG TCGTCGAACA 3360 

TATGTATACC GC CTACACTT ATGTGTACAC ACTCGGCGAtVaCTGAAAGAA AATTAATGTT 3420 

GGACATACAC ACGGTCCTCA CCGACAGCTG CCCGCCCAAA GACTCCGGAG TATCAGAAAA 34 80 

GCTACTGAGA ACATATTTGA TGTTCACATC AATGTGTACC AAQATAGAGC TGGGCGAAAT 3540 

GATCGCCCGC TTTTCCAAAC CGGACAGCCT TAACAT CT AT AGGGOATTCT CCCCCTGCTT 36 00 

TCTAGGACTA AGGTACGATT TGCATCCAGC CAAGTTGCGC GCCGAGTCGC CGCAGTCGTC 3660 

CGCTCTGACG CGGACTG CCG TTGCCAGAGG AACATCGGGA TTCGCAGA&T TGCTCCACGC 372 0 

GCTGCACCTC GATAGCTTAA ATTTAATTCC GGCGATTAAC TGTTCAAAGAV TTACAGCCGA 378 0 

CAAGATAATA GCTACGGTAC CCTTGCCTCA CGTCACGTAT ATCATCAGTT CCGAAGCACT 3 84 0 

CTCGAACGCT GTTGTCTACG AGGTGTCGGA GATCTTCCTC AAGAGTGCCA TGTnTATATC 3 900 

TG CTATC AAA CCCGATTGCT CCGGCTTTAA CTTTTCTCAG ATTGATAGGC ACATTCCCAT 3960 

AGT CTACAAC ATCAGCACAC CAAGAAGAGG TTGCCCCCTT TGTGACTCTG TAATCATGAG 4 020 

CTACGATGAG AGCGATGGCC TGCAGTCTCT CATGTATGTC ACTAATGAAA GGGTGCAC5AC 4 080 

CAAC CTCTTT TTAGATAAGT CACCTTTCTT TGATAATAAC AACCTACACA TTCATTATTO 414 0 
GTGGCTGAGG GACAACGGGA CCGTAGTGGA GATAAGGGGC ATGTATAGAA GACGCGCAGcX 42 00 
CAGTGCTTTG TTTCTAATTC TCTCTTTTAT TGGGTTCTCG GGGGTTATCT ACTTTCTTTA \ 4260 

CAGACTGTTT TCCATCCTTT ATTAGACGGT CAATAAAGCG TAGATTTTTA AAAGGTTTCC 4320 
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TGTGCATTCT TTTTGTATGG GCATATACTT GGCAAGAAAT CCGAGCACCT CAGAAAGTGG 4380 

ATTG CCGTCA CATATC^AGTT CGACCACCCC TGCACCTAGC CATG CGG CG C TTTGACGGTC 444 0 

TTTGGGGCTA CACATCATAA AGTACTTTTC CATGGCTTCT ATAAGCACCT TGGAACAATC 45 00 

TGGGGGTTGG CGAATGGgW CCCTAAACGG GAAATCCTCT ATGGTATTCA GGCAGAAGAC 4 56 0 

CGCGTCCTCC ACCCGACGTTV TGAGTCTTTC TAGCAGAGCG CCGAAGAACT CCCGCTCGTG 4620 

TGTTTTCGCA GGGGCAAGTT OTGCGCCGTA CAGCGATGAG AAACACGACA CGATGTTTTC 46 8 0 

CAGCCCCATG CTGCGCAGCA ACACGTGCTT CAGGAACAGG TGTTGTAGCC GGTTCAGTTT 4 74 0 

TAGCTTGGGT AGAAAAGTTA TCGA>GTTGTT AGCACGCTCC ATGATGGTAA CGGTGTTGAA 4800 

GTCACAGACC GGG CTTTCTC CGAGTOTCGG CCGCCTGAGT CCAATCATGT AGAACATAGA 4 860 

CGCGGCCTCG TTGTCTGTGT TAAGTGaVaC GATATCCCGT TCGCAAACCT GTGCGATGTT 4 92 0 

GTGTTTCAGT ATAGAT CTGG TCTGACCGGC ACGGGGTGTT ATGGGGTGAC GCGGTAAAGG 4 980 

CGACTCTGGG TCAAACACCT TTATGCGGtAgGCGGCCTCG TCGATGACGA CACGCTTGTT 504 0 

CGCGGCGTGT ATGGGGACGC GACGGCATCC OGCTGGCAGA TCTATAATCT TAAAGTTGGT 5100 

ATAAGACTGG TCGCTCGTTA TGGCCAGCCG GCACTCCGGT AGTATCTGCG TGTCCTCGAA 5160 

TTCGTGGCCG CGTACGACTG GCTTGGAGTG CAGOTAAACG CCAAGAGATG CGGTCTCTTC 5220 

GCCTACGCAC AAGTGGCTTC TTAACGCGTA GGGGTCCGGT GAGAGCATGA TCCGTAGCAA 528 0 

CGATAGTTCC GGGTGCCTAG CCGCGTAGAG TGGCAGOGTA GACGAGTCCG GAGTCCCAAA 534 0 

CTTTTCGAAC AACAGTGGCA TCGGGACTTC AGGATTAGAG ACTCCCACCA TGGCCGCCAC 5400 

CG C CGG AG AG GTCAAGACGT GAAACACGCG CTCGCCTGTfC GACAGGCGCG CCGCGCCCTC 546 0 

TACTAGACTA GCCTTCACGT CCGGAACTCG TAACATAGCtVaGACCAGCG GACGGACGCA 552 0 

ACGTACGCGG GGATCGGCTG GCGGTGTCTG CTCGTTGGAC GtGGCCGTTC GGTGGCGCCA 5580 

GTGCAGGCCT AGTTTGCGAA TGGCGTGACG GACAATTTGT GgVtTTAGAG CGGCGAACCG 5640 

ATGACCCGTG GTGGCGACGA ACGAAATGAA GTTTGCATTG CGGOCCAACT CGTCTAGCCT 5700 

GGTCTTCTTG TTTCGGGCAT AGATTTTCGG GATTAGGTTA CACl"lViTAT ATCCCAGTAC 5760 

TGCGCACTCG TGTTTGCTTT TAGTGTGACT GATTATCTTC TTTGAGffiAGT CAAACAGGCC 5820 

CCGGGCGGCG GCTCGCCTAA TGCAAGCCAC GTCAAGCCTG AGAAACGaV: AGCATTCCAC 5880 

"CAGACACTCC AGGAACCTTT TGTGTAGCGT CTGTATTTGG GAACGGTTTCv TGTGCTCAAG 5940 

TAGGGAGAAT ATTCTATTTT TGTTTCCGTC GATGCGCGCG TGCTGGTCCG TCAGAATGGG 6000 

CGCCAGCTCG TGGCGAATCT GTTCCACAAG AGGCTGCCCG TACACTTTAG AHATCGTGGC 6060 

TGTCGCGGCC TTAAACCAGG ACACGTTTAG CCCATCCTTG CTGGAGACCA CAGATGGAAA 6120 

GTTTGTGGTC CAAAATACGT TTTTTCGCCC CATTCTCACC ATGTACTGGT TTTCCJW3TCC 6180 

GTGCAGGTCC AACGTGGAGT TCCAATTTGC TATCGATACA GGAAATATGT GCCTGATO3G 624 0 

CAGAAAGCAT TTCAGCGTAC CCATTGCGAA GAGAAAGTGC AGCATGTCCC CACTGATGTT. 6300 
GATGTTTATT GCGGTGCCTT GACACATGTT GTCGGAAAAA AACACG CTTA TGGTAAAAGA ^ 636 0 



WO 96/06159 



PCT/US95/10194 



AGGTTCCTTT 
AATAGTCTTT 
GGTGTTGAAT 
GTCTGACACT 
CGCCTGCAAA 
AAGCGTCGCC 
CAGTCCGCTG 
GATGGCCGGT 
CCTGTATGGC 
CTGGTTTAGA 
CTTACTCTGT 
GGCACGGCGC 
GTTAAACTTT 
GAGATGCTTC 
ATGGGGCAAC 
CCTGCACACT 
ATACAGCGAG 
TGATCCCACC 
ACCGTCAATT 
TTCTTGATGT 
GGCGAAAAAG 
CTGCAGGTGC 
TTGATAACAC 
GCCCTCGAGC 
GGAGGCCAAC 
TCAG CTATTG 
* CGTATATACC 
CGTCAATACC 
TTCAATGCCC 
CGTCATGAAG 
CAT C GAG CTT 
CATCAAGACG 
GGACACGGTT 
GGGCGATCCC 



ACGGAOTACT 
TGCAGGG 
ATGGTGAT 1 
GTAGAGCTGC 
TTTCCTTTCA 
GCCAGGAGAC 
CACGAGTGGC 
GTGCCCGGAT 
TTTGGTC CGG 
GCCAGCTGAA 
TTCGACAGGT 
AGCGAGACCA 
AAAAAATGTA 
AGGCTCTCCA 
ACCGCCAGAA 
ATCCTGAACA 
CGTAGCTCCC 
G CAGGTAGGT 
ATTAAACCTT 
ATAAAAGGGT 
GTGGGCTCGG 
GAGTTGGTGG 
GTGGTCTGCG 
GCTGCCATGG 
CTCCTAACGC 
CTCGGCAAGG 
AATGTGGTGG 
GAGTTCAAGG 
ACTATTGCCC 
GCTTGCAATA 
CTCTTCGCCG 
ATTACGTCGG 
CTCGCAGTTA 
GTCTACTCTG 



TTCGTATAAC 
TAGGAACGTG 
TGAAGTTTTC 
t CCAGAGTCCG 
1CTGCTCG 
ITGTCTC 

cgatNscagtc 

ACACT. 
GGTCCTT^GCG 
AGCCCAC 
TCTTCAGCA 
GCTCTCCGTG 
GCTGTGCGTC 
GGAGTGCAAA 
ACGCATGAAA 
TGGCTTTGTA 
TGAATTCGCA 
AGTTGTCGGT 
CGCCGCTGTA 
GGAGGCGTTC 
GCTGCATCAT 
CCAGACAGCA 
GACTCGTCGT 
AGGCGACCTT 
AGATTAAGGA 
ACGCCAGAGA 
AGTTTGTTAA 
ACCTGCGGAG 
ACGGAGACGG 
AGCACCACAT 
AGAAAGAGAC 
CTTTGCAGTT 
AACTTCGGCA 
AGAGGGGCCT 
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AAAATTGTTG 
GCAGCTTATC 
CIAAACTGACG 
CGCGTCCGTG 
CCGGTCTTTC 
GTGGGTGCCT 
TGCCACTGCC 
GTAGGTACAA 
TTGGATTTTT 
ATCCCGTCCG 
GGTGGGCAGT 



:cacccccac 



ACATA1 



T&GGGATGCG 
ATsATTTTGA 

acacVgttcg 

'GGTG 

GGGTrrAycA 

GTCTAT 
CCGTCGACC 
CCCCAGGAGT 
CTTATCAAGA 
GAATATTTCC 
CAGGGAGGCG 
GGAGCAACGA 
GTCGGCTGCC 
AGGCAGTGTC 
GTTTCTGGAG 
AATGATAGAT 
GAGGAGGCCC 
CGGTGCGGAG 
GCCCTTGGAC 
TGGTATGGAC 
CGCTCCACCC 
CAAAAAGGCC 



GTCAATCTGG 
TTAGTGTTAA 
TGTTTTGTGG 
GCCGCGTATC 
GGCGCGTACC 
AAAAAGTTTG 
ATACACATGA 
TCTGGGGTAC 
ACGTGCAGAC 
TTAAC CTTGA 
CGCTCTACGT 
GTGGCCATGA 
GGTGGCATTA 
TAGATTGTGG 
AACTCCCAGA 
CACGTTAGTA 
CAATCATCGG 
CCGCGCGTAA 
ACTTTTCCCA 



AGTCTGCGTA 
'CTAAGG 
ITGATT 
CTC^GTGGCA 
CCTTtCCCGT 
GAG 

CGTTTCGAAG 

accgccchcg 
ggaaaaataN 
aacaagcaga 1 



ICTCT 



ATTGAGCTTG 
TTCACAGAGT 
GCCCTAGAAC 
GTCTTTATTT 
GTCAAGTCTG 



GGATGTTTAA 
TCACCATGTT 
GTTCCAGCAT 
GTTGGAAGCA 
GGATTCTTGA 
CGCAGGGGTG 
CGAGT CTGTA 
TGACGACCAC 
GGGACACGAG 
CGTCCTGGTG 
TGTGAGCGAT 
AG CTG CTGAT 
TTGAAAACGA 
GTTGTAGACT 
ACTC CAGGTA 
GCGCGGGAAG 
TAAGTTCCCA 
ACACTCCACC 
AAAGAGTCCC 
TCGCTCTGCA 
TCAGCTCTGC 
CCCAAGTCGC 
GTAGTAGGGG 
ACCTCGCCAC 
TCAAGAGCTT 
CGCTACTGGG 
CCGCCGCTTG ■ 
AGTTTAAAAT 
GACAGTATAT 
pGCCGCAGA 
^CGGGTGC 
ICTAGT 
TAAAdRCGCT 
ACATGG^ATC 



6420 

6480 

6540 

6600 

6660 

672 0 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 
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CATGTTCAAG GCACACCTCA TAGAACATTC ATTTTTTCTA GATAAGGCCG AGCTCATGAC 846 0 

AAGGGGGAAG CAGTATfeTCC TAACCATGCT CTCCGACATG CTGGCCGCGG TGTGCGAGGA 8 52 0 

T AC CG TCTTT AAGGGTGTCA GCACGTACAC CACGGCCTCT GGGCAGCAGG TGGCCGGCGT 8 580 

CCTGGAGACG ACGGACAGCOy TCATGAGACG GCTGATGAAC CTGCTGGGGC AAGTGGAAAC3 864 0 

TGCCATGTCC GGGCCCGCGG VrTACGCCAG CTACGTTGTC AGGGGTGCCA ACCTCGTCAC 8 700 

CGCCGTTAGC TACGGAAGGG CGATGAGAAA CTTTGAACAG TTTATGGCAC GCATAGTGGA 8 760 

CCATCCCAAC GCTCTGCCGT CTGTJGGAAGG TGACAAGGCC GCTCTGGCGG ACGGACACGA 882 0 

CGAGATTCAG AGAACCCGCA TCGCCT3CCTC TCTCGTCAAG ATAGGGGATA AGTTTGTGGC 8880 

CATTGAAAGT TTGCAGCGCA TGTACAACGA GACTCAGTTT CCCTGCCCAC TGAACCGGCG 8 94 0 

CATCCAGTAC AC CTATTTCT TCCCTGTTfeG CCTTCACCTT CCCGTGCCCC GCTACT CGAC 9000 

ATCCGTCTCA GTCAGGGGCG TAGAATC CC^ GGCCATCCAG TCGACCGAGA CGTGGGTGGT 9060 

TAATAAAAAC AACGTGCCTC TTTGCTTCGG VtaCCAAAAC GCCCTCAAAA GCATATGCCA 9120 

CCCTCGAATG CACAACCCCA CCCAGTCAGC cVagGCACTA AACCAAGCTT TTCCCGATCC 9180 

CGACGGGGGA CATGGGTACG GTCTCAGGTA TGAGCAGACG CCAAACATGA AC CTATTCAG 924 0 

AACGTTCCAC CAGTATTACA TGGGGAAAAA CGTGfeCATTT GTTCCCGATG TGGCCCAAAA 9300 

AGCGCTCGTA AC CACGGAGG ATCTACTGCA CCCM^CTCT CACCGTCTCC TCAGATTGGA 9360 

GGTCCACCCC TTCTTTGATT TTTTTGTGCA CCCCTGTCCT GGAGCGAGAG GATCGTACCG 9420 

CGCCACCCAC AGAACAATGG TTGGAAATAT ACCACAAcVs CTCGCTCCAA GGGAGTTTCA 948 0 

GGAAAGTAGA GGGGCGCAGT TCGACGCTGT GACGAATATgX ACACACGTCA TAGACCAGCT 9540 

AACTATTGAC GTCATACAGG AGACGGCATT TGACCCCGCG VaTCCCCTGT TCTGCTATGT 9600 

AATCGAAGCA ATGATTCACG GACAGGAAGA AAAATTCGTG A^GAACATGC CCCTCATTGC 9660 

CCTGGTCATT CAAACCTACT GGGTCAACTC GGGAAAACTG GCgVtTGTGA ACAGTTATCA 972 0 

CATGGTTAGA TTCATCTGTA CGCATATTGG GAATGGAAGC ATCcVrAAGG AGGCGCACGG 9780 

CCACTACCGG AAAATCTTAG GCGAGCTCAT CGCCCTTGAG CAGGCOSCTTC TCAAGCTCGC 984 0 

GGGACACGAG ACGGTGGGTC GGACGCCGAT CACACATCTG GTTTCGQCTC TCCTCGACCC 9900 

GCATCTGCTG CCTCCCTTTG CCTACCACGA TGTCTTTACG GATCTTATCC AGAAGTCATC 996 0 

CAGACAACCC ATAATCAAGA TCGGGGATCA AAACTACGAC AACCCTCAaX ATAGGGCGAC 10020 

ATTCATCAAC CTCAGGGGTC GCATGGAGGA CCTAGTCAAT AAC CTTGTTA VcATTT AC CA 10080 

GACAAGGGTC AATGAGGACC ATGACGAGAG ACACGTCCTG GACGTGGCGC COCTGGACGA 1014 0 

GAATGACTAC AACCCGGTCC TCGAGAAGCT ATTCTACTAT GTTTTAATGC CGGTX3TGCAG 10200 

TAACGGCCAC ATGTGCGGTA TGGGGGTCGA CTATCAAAAC GTGGCCCTGA CGCTGACTTA 1026 0 

CAACGGCCCC GTCTTTGCGG ACGTCGTGAA CGCACAGGAT GATATTCTAC TGCACCTGGA 1032 0 

GAACGGAACC TTGAAGGACA TTCTGCAGGC AGGCGACATA CGCCCGACGG TGGACATGAX 10380 

CAGGGTGCTG TGCACCTCGT TTCTGACGTG CCCTTTCGTC ACCCAGGCCG CTCGCGTGaA 1044 0 
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CACAAAGCGG 


GACCCGG&CC 


AGAGTTTTGC 


CACGCACGAA 


TACGGGAAGG 


ATGTGGCGCA 

* * ^ W A- WWWWWC* 


1050 0 

J* \J ^ \J w 


GACCGTGCTT 


GTTAATGGOT 


TTGGTGCGTT 


CGCGGTGGCG 


GACCGCTCTC 


GCGAGGCGGC 


-L \J \J \J 


GGAGACTATG 


TTTTATCCGG 1 


< TACCCTTTAA 


CAAGCTCTAC 


GCTGAPPCGT 

w W X w^^W W W w X 


TGG TGG PTGP 

X ww X WOW X OW 




CACACTGCAT 


CCGCTCCTGC 


SAAACTATGT 


CACPJVGG PTP 

WA W WAV7W W X W 


PPPHAPPAGA 

W W WAA W WAOA 


GAAiPGPGGT 

OAAAWOWOw X 


J.UOOU 


GC5 T PTTT TLB. C* 


GTGPPATCCA 


ATC^TPATGGP 

n. X^. X X OOW 


AGAATATGAG 

AwAA X A X OA w 


wAA 1 OOWAWA 


ao X LuLLLbi 




CGPRGPGTAT 

WwWOOWO X ^1 ^ 




GTPAGGPPAP 

O X WAUOWWAW 


P P Pf2P.f2 CfZ P P 
WW W ooo wow W 


fiTTlrtPrtPPS 
AX lAwWwWWA 


X wO 1 OAwWAX 


t a q n n 


wLnv. W AAAAA 


W X AX W lu'teV.L 


PPAG*WTTPAT 
W WA w lu X WA X 


TTrtPPBPPPR 
X loWWAOOWA 


fcXPBPPPPIi 
AAAWA W W wWA 


1 OWAW W W 1 WO 


T H Q C A 

IU00U 


TTTTrtO p & Trz 


nw/\u x W o x wi 


ezcz a pfirsW pfs a 

OOA W O OA W OA 


OO X 1 W X AO WA 


OAOWAWA i WW 


1A1AL 1 ww X w 


T Q *> A 

1092 0 


W AOO o W o l Wo 


TPP ETrtT 


llul L»wwW\L X 


OWW 1 IWwwTw 


nm« ««««««« 

GTACGGCGCG 


AGGTACGTTC 


^ ft A O ft 

10980 


GGACGCGCTG 

w\Jnwww ww X w 


AW X X X X wAAA 


X 1 AW W WaWwA 


WAX WwW 1 X WW 


W 1 wWAWAWUw 


UlL x X wow 1 A 


T T ft J( A 

1104 0 


w 4 w-T% X U/IO X w 


AX WO WWW WOW 


WWWAWw 1V7UL 


WOWWAl AAW 1 


AWAO AWA 1 OO 


bAUIAUAl X G 


1 T T ft ft 

11100 


TCAGGACCTC 


T TT ATG ATTT 


TCCCAGGGGA 


PrePGTATPArt 

WwWw XAX WAV 


oaW Wow wao w 


X wWA X OAW X A 


liio U 


TATCAAAATG 


AAAGCGGGCG 


TG CAAACCGG 


CTHAPPGGGA 

w X WVkW W OOOf^ 


AAPAGAATGG 

AA WA OAA X WW 


A TP A PGTGGG 

A X WA W O X O WW 




ATACACTGCT 


GGGGTTCCTC 


GCTGCGAGAA 


CCTGttCCGGT 


TTGAGTCATG 


GTCAGPTGGP 
o x w^vow x oow 


1 1 280 

X X.4. O W 


AACCTGCGAG 


ATAATTCCCA 


CGCCGGTCAC 




GPPTATTTPP 

WWW 1A1 X X WW 


AGAPPPPPAG 

AOA W WWW WVJ 




CAACCCCCGG 


GGGCGTGCGG 


CGTCGGTCGT 


GTCGTGTSAT 


GPTTAPAGTA 

wW X X AWAO XA 


APGAAAGPGP 

AWOAAAOWOW 




AGAGCGTTTG 

**W*kW * ^ 4 


TTCTACGACC 


ATTCAATACC 


AGAPPPPGHG 

AwAW W W W w Wu 


TiPflllTRPP 

X AW OAA X O W W 


rtPTPPaPPAi 
OO X WWAWWAA 


1 lien 


C AAC C CGTGG 


GCTTCGCAGC 




PGGPG A PGTG * 
Ww\9WwAWw X o 


kPTATAPA ATA 
VW X AX aWAAX a 


x WaWW XXX Wo 


X, 13« U 


CC AG ACTGCG 


CTGCCGGGCA 


TGTACAGTPP 

X W X AWAU X WW 


TTGTPGGP A G 
X X o X WwwWAo 


iUti 1WWAWA 


AOOAAOA V_A X 


1 1 eon 


TATGCGGTAP 

X ^» «L wV»* WW X w 


AATAGGGGGT 


TflTlPlPPTT 

X w X A WAW XXX 


uul XaaIoao 


1 A \ X W 1 OWWA 


oow X X uL X oo 




GGCCCCCGCC 

ww w w w w w w^^# w 


ACCAGCACTA 


CAGACCTTCPA 


W X AWO X Wo X O 


O X WAA WwO X A 


P A f3 A Pf3Tf2TT 
WA-wAWO X 0 X X 


1 T TOO 


TTTGG AC C AG 


W W X X ow WiA x^% 


TGCTGCAGGA 

X WW X 0^_AwwA 


wOWW X AX WWW 


A PGP*Tnr*G PPG 
AWoW X tuLLu 


ppAGPPAPAr; 

W WAOW WAWAO 


11 / ou 


& ft' l"l 'A Tf5 ( " ! M 1 1 


G P PGAGTAPA 

O W W w AO X A WA 


TGTPAAAPAA 
X w X WAArlWAA 


OWAOAWAWAW 


OW W W WAUL 1 AW 


ALA X woo w WA 


1 T Q *5 A 


O X ni W x Wax X 


G A A G A GGTGG 

OAAOAww X WW 


PGP PGA TG A A 
W\l>WWwA.X waa 


OAwAW 1 A 1 X A 


AAOW 1 WOwlKA 


& PA ^ftftTftftT 
AWAAOO X OO X 


noon 
iiosu 


O X X X f^OW X n, 


f^W w w X X W X ^%w 


PG TTGGPT A f2 

Ww X X VJV3W X Aw 


1 WA X OwWAW 1 


Prt A P A a ^3 A f3T 

WwAWAAwAO X. 


AXAoXowX XA 




AC X X W'^WW X W 


P 1\ G A PTPTTP 
Lnwnv X W X X W 


GCTGATGAAr 

W/W X VXA X VJAAW 


TfirtPPflPPPT 
X OOWWOWWW 1 


TPAGTPAAAi ' 
X WAO X WAAAA 


Vatagggagpg 

V% X AO OO AO w 0 




InL X OWWOW X 


PGGAGATTGC 

WwOf%w4%X X OW 


P A PPGTTT A P 

W^^WWw X X 


AAAAT AT A PA 

AAAA X A X A WA 


GGP ATTGGG P 

ww WAX XwwOW 


pNpggggtgpg 

W X OO/O/O x owo 


12060 

X X w w U 


ini W lw X W^kWO 


TGAGAPATCT 

X wr%w^% wV X w X 




TCCAAATTAT 

X WWAArlX 


G PAGTATPTA 

O Wlw X X W X A 


TCaAAGTGCA 


12120 


CACTCGCTGT 


CCTGGAGGAG 


GTTCGCCCGG 


ACAGCCTGCG 


CCTAACGCGG 


ATGGATCCCT 


12180 


CTGACAACCT 


TCAGATAAAA 


AACGTATATG 


CCCCCTTTTT 


TCAGTGGGAC 


AGCAAOACCC 


12240 


AGCTAGCAGT 


GCTACCCCCA 


rrrri'TAGCc 


GAAAGGATTC 


CACCATTGTG 


CTCGAATtCA 


12300 


ACGGATTTGA 


CCCCGTGTTC 


CCCATGGTCG 


TGCCGCAGCA 


ACTGGGGCAC 


GCTATTCTGC 


12360 


AG CAG CTGTT 


GGTGTACCAC 


ATCTACTCCA 


AAATATCGGC 


CGGGGCCCCG 


gatgatgtaaX 


12420 


ATATGGCGGA 


ACTTGATCTA 


TATACCACCA 


ATGTGTCATT 


TATGGGGCGC 


ACATATCGTC 


\12480 
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TGGACGTAGA 


CAACrfCGGAT 


CCACGTACTG 


CCCTGCGAGT 


GCTTGACGAT 


CTGTCCATGT 


12540 




ACCTTTGTAT 


CCTATCAGCC 


TTGGTTCCCA 


GGGGGTGTCT 


CCGTCTGCTC 


ACGGCG CTCG 


12600 




TGCGGCACGA 


CAGGCATOCT 


CTGACAGAGG 


TGTTTGAGGG 


GGTGGTGCCA 


GATGAGGTGA 


12660 




C CAGGAT AG A 


TCTCGACCACS 


TTGAGCGTCC 


CAGATGACAT 


CACCAGGATG 


CGCGTCATGT 


12720 




TCTCCTATCT 


TCAGAGTCTC 


Vgttctatat 


TTAATCTTGG 


CCCCAGACTG 


CACGTGTATG 


12780 




CCTACTCGGC 


AGAGACTTTG 


GOGGCCTCCT 


GTTGGTATTC 


CCCACGCTAA 


CGATTTGAAG 


12840 




CGGGGGGGGT 


ATGGCGTCAT 


CTGATATTCT 


GTCGGTTGCA 


AGGACGGATG 


ACGGCTCCGT 


12900 




CTGTGAAGTC 


TCCCTGCGTG 


GAGGT^GGAA 


AAAAACTACC 


GTCTACCTGC 


CGGACACTGA 


12960 




ACCCTGGGTG 


GTAGAGACCG 


ACGCCATCAA 


AGACGCCTTC 


CTCAGCGACG 


GGATCGTGGA 


13020 


Wl 


TATGG CTCGA 


AAGCTTCATC 


GTGGTGCCOT 


GCCCTCAAAT 


TCTCACAACG 


GCTTGAGGAT 


13080 


H. 1 


GGTGCTTTTT 


TGTTATTGTT 


ACTTGCAAAA 


CTTGTGTGTAC 


CTAGCCCTGT 


TTCTGTGCCC 


13140 


'P° 


CCTTAATCCT 


TACTTGGTAA 


CTCCCTCAAG 


CATTGAGTTT 


GCCGAGCCCG 


TTGTGGCACC 


13200 


=0 


TGAGGTGCTC 


TTCCCACACC 


CGGCTGAGAT 


GTCrCGCGGT 


TGCGATGACG 


CGATTTTCTG 


13260 




TAAACTGCCC 


TATACCGTGC 


CTATAATCAA 


CACCMGTTT 


GGACGCATTT 


ACCCGAACTC 


13320 




TACACGCGAG 


CCGGACGGCA 


GGCCTACGGA 


TTACTCCaTG 


GCCCTTAGAA 


GGGCTTTTGC 


13380 


i 


AGTTATGGTT 


AACACGTCAT 


GTGCAGGAGT 


GACATTGTGC 


CGCGGAGAAA 


CTCAGACCGC 


13440 


IsssP 

EST. 


ATCCCGTAAC 


CACACTGAGT 


GGGAAAATCT 


GCTGGCTATG 


YTTTCTGTGA 


TTATCTATGC 


13500 


CTTAGATCAC 


AACTGTCACC 


CGGAAGCACT 


GTCTATCGCG 


AGCGGCATCT 


TTGACGAGCG 


13560 




TGACTATGGA 


TT ATT CAT CT 


CTCAGCCCCG 


GAGCGTGCCC 


TCGOCTACCC 


CTTGCGACGT 


13620 




GTCGTGGGAA 


GATATCTACA 


ACGGGACTTA 


CCTAGCTCGG 


CCTGGAAACT 


GTGACCCCTG 


13680 




GCCCAATCTA 


TCCACCCCTC 


CCTTGATTCT 


AAATTTTAAA 


TAAAGGTETG 


TCACTGGTTA 


13740 




CACCACGATT 


AAAAACCACT 


CACTGAGATG 


TCTTTTTAAC 


CGCTAAGGGSA 


TTATACCGGG 


13800 




ATTTAAAACC 


GCCCACTGAT 


TTTTTTACGC 


TAAGAGTTGG 


GTGCTTGGGGi 


IGGTTTTGCAT 


13860 




TGCTCTGTTG 


TAAACTATAT 


ATAAGTTAAA 


CCAAAATTCG 


CAGGGAGACA 


AGGTGACGGT 


13920 




GGTGAGAACT 


CAGTTGAGAG 


TCAGAGAATA 


CAGTGCTAAT 


CAGGGTAGAT 


ga\catgact 


13980 




TTCCCCGTCT 


CCAGTCACCG 


GAGGAATGGT 


GGACGGCTCC 


GTCCTGGTGC 


GAATGGCCAC 


14040 




CAAGCCTCCC 


GTGATTGGTC 


TTATAACAGT 


GCTCTTCCTC 


CTAGTCATAG 


gcgccVscgt 


14100 




CTACTGCTGC 


ATTCGCGTGT 


TCCTGGCGGC 


TCGACTGTGG 


CGCGCCACCC 


CACTAGGCAG 


14160 




GGCCACCGTG 


GCGTATCAGG 


TCCTTCGCAC 


CCTGGGACCG 


CAGGCCGGGT 


CACATGCAOC 


14220 




GCCGACGGTG 


GGCATAGCTA 


CCCAGGAGCC 


CTACCGTACA 


ATATACATGC 


cagattagaa\ 


14280 




CGGGGTGTGT 


GCTATAATGG 


ATGGCTATGG 


GGGGGGGCTG 


TAGATAATTG 


AGCGCTGTGC 


\l4340 




TTTTATTGTG 


GGGATATGGG 


CTTGTACATG 


TGTCTATCAT 


CGGTAGC CAT 


AAAATGGGCC 


M400 




ATGACAACTG 


CCACAAGTAA 


GTCGTCCGAC 


ATGTGCTTTT 


GCTTGGCGCT 


GTATGACTGC 


14450 




CCTCCATCCC 


TAAGCGGGAC 


GCACTTGATC 


GCGCGGACCT 


GTTCTACCAG 


GTAGGTCACC 


145 20^ 
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ft ft v*a^-« 

GGGTCAAATG 


ATATTTTGAT 


GGTGTTGGAC 


ACCAC CGTCT 


A A AM) A AAA AAA 

GGCTGGCGCT 


A ft A A A AAA A A A 

CAGGGTG C CG 


i / CO A 

14 58 0 


,m — mmm*!^**, ft <*i 

GAGTTCAGAG 


CGTAGATGAA 


■■a Affe^AA a ft tv 

TGTICTCAAAC 


A AAA ft A A ft »UII 

GCGGAGGATT 


TCTCGCCTCC 


ft ft ^"fft mAM) ft m 

CAACATGTAA 


1464 0 


ATTGGCCACT 


GCAGGGCGCT 


GCTOTTGTCA 


GTATAGTGTA 


GAAAATGTAT 


GGGGAGCGGG 


14700 


CATATTTCGT 


TAAGGACGGT 


tgcaaYggcc 


ACCCCAGAAT 


CTTGGCTGCT 


GTTGCCTTCG 


14760 


ACCGCCGCGT 


TCACGCGCTC 


AATTGTOGTG 


TGGAGCACAG 


CGATCGCCTT 


■m am MA^tft Ml AAMA 

AATCATCGTG 


14820 


CATGCGCAGG 


ACGCTATCTC 


0m Hk ft TV J^9 0m ft 0m JA) ""A. 

GTAAGCAG (ET 


^^^^■4^^*»ft AfRnik 

GCGCCAGTGA 


GGTCGCGCAG 


^1 ft *a\ A ft ft ft MM J"™4 A 

G AAGAAATG C 


14 o 8 0 


TCCATGCCCA 


ATATGAGGCT 


TCTGGTGGGAX 


GTCTGAGTAC 


TCGTGACAAC 


GG CGC C CACG 


14 94 0 


CCAGTACCGG 


ACGCCTCCGT 


GTTGTTCGTA 


Ml* ^B) ^BB ^B) At ^"4 ^"t AA 

TJACGCGGGGT 


CGATGTAAAC 


ft ft ft ft A^Arri 

AAACAGCTGT 


15 000 


TTTCCAAGGC 


ACTTCTGAAC 


CTCCTGGGCG 


GT1SGTGTCTA 


CCCGACACAT 


GTCAAACTGT 


15060 


GTCAGCGCTG 


CGTCACCCAC 


*""4ft ^"4^"t ^■4^"»^AAAft ft 

CACGCGGTAA 


ft >■» A/4«n m 0m 0m ft p"A 

AGCGTAGCAT 


mma« ^my^m m\ rmpmpm 

TTGACGACGC 


AAA A)AA A A AjMM A/*« 

TGCTCCCTCG 


15120 


0m0m0mm\. MAMM**. ifUfi 

CCCATTAGTT 


*pm rmm *m MM ft ft 

CGGTGTCGAA 


TGCCCCCTCC 


ATAAA<UVGGT 


TGGTGGTGGT 


TTTGATGGAT 


15180 


mm ^"i ^aima^"*^"* ft aa^"*^ - "! 

TCGTCGATGG 


paa ft ft ^■•^■•aa 

TGATGTACGT 


a a a ft «m/^m/1^ 

CGGAATGTGC 


AGTCTGT&AC 


f\ fk A A ft A« A A » 

AAGGACAGGA 


A ft AM* A A-»A A A 

CACTAGTGCG 


15240 


TCTTGCAGGT 


GGAAATCTTC 


TCGGTGGTCC 


GCACACACQT 


ajh «h AAA % ^"t ^"4 m ^m 

AACTGACCAC 


■fh MfflA ft ^ b t^"tft AAA 

ATTCAG CATC 


153 00 


TTTTCCTGGG 


CGTTCCTGAG 


GTTAAGCAGG 


* * *. /'mm a nrw a a\ 

AAACTCGTGGi 


kAGCGGTCTGA 


CGAGTTCACG 




GATGATATAA 


ATATAAGCTT 


GGCGTCTTTC 


TGAAGCATGA 


•JL m\ ^m^m^m m\ j*m m * a 

ACCCCAGAAT 


ft A AAA AA* ^MMM 

AGCCGGCAGT 


15420 


GCATCCTTTT 


TAATAAAATT 


CGCCTCGTCT 


Ml ^•I^aftMMft .At «h A^"*ft 

ACGTAGAGCA 


AMWft* * * JM JPM MM 

GGTTAAAGGT 


APfIA am ^m rm *m m, 

CTGTCCCCGA 


15460 


ATGCTCTGCA 


GACACGGAAA 


^a> bl mm BJk AMI *k ■& ft ft 0m 

GACACAAAAG 


ft 0*0* »AJ »M1 MMM'a >M 

AGGGGCTCAT 


AAGOGGCTAA 


— Mt m Antii ft ft ^"t ft 

CAGTAAAGGA 


15540 


GAGGAGGCGA 


ACAGTGCGTG 


GCTCTTGGTT 


CTTGGGAATA 


AAAGGGGGCG 


■BfA ffIA MA^"4 ^"4 ^"4 ^"t ft 

TGTGTGCCGA 


15600 


TCGATCGTAT 


GGGTGAGCCA 


GTGGATCCTG 


GACATGTGGT 


GAATGASAAA 


GATTTTGAGG 


15660 


AGTGTGAACA 


ATTTTTCAGT 


CAACCCCTTA 


GGGAGCAAGT 


GGTCGCGSGG 


^^VfTl^^ft. A^M Aft A 

GTCAGGGCAC 


15720 


MP»^*»^"»ft ^"4 ^AAA 

TCGACGGCCT 


CGGTCTCGCT 


GACTCTCTAT 


At 1 ^ Am A * * A 

GTCACAAAAC 


AGAAAGACTlC 


TG CCTG CTGA 


13 / bU 


mnn* ^■»^"4AM^A^ B tAA 

TGGACCTGGT 


GGG CACGGAG 


TGCTTTGCGA 


A A AJFUAMA A A A 

GGGTGTGCCG 


CCTAGACACC^ 


GGTGCGAAAT 


T C Q A rx 


GAAGAGTGTG 


GCGAGTCCCT 


TATGTCAGTT 


CCACGGCGTG 


MHWMMlA Affl ^A MM 

TTTTGCCTGT 


ACCAGTGTCG 


15900 


C CAGTGC CTG 


GCATACCACG 


TGTGTGATGG 


GGGCGCCGAA 


TGCGTTCTCC 


TGXZATACGCC 


t e a £ rt 
1596 0 


GGAGAGCGTC 


ATCTGCGAAC 


BBjBift ft J*MM/*lfl« *_ 

TAACGGGTAA 


CTGCATGCTC 


GGCAACATTC 


AACjAGGGCCA 


t c n i 
1602 0 


GTTTTTAGGG 


CCGGTACCGT 


ft mm a ft ^wwwn 

ATCGGACTTT 


GGATAACCAG 


A MAMMA ft A ft A A A 

GTTGACAGGG 


ACQCATATUi 


t c n q n 


CGGGATGCTA 


GCGTGTCTGA 


am "JL AAA AH «h MM 

AACGGGACAT 


(■/■iHA aa Afn% m 

TGTGCGGTAT 


MfMAA* ^m m +m mx mm 

TTGCAGACAT 


GGCCGQU*.CAC 


10l4 U 


CACCGTAATC 


GTGCAGGAAA 


TAGCCCTGGG 


GGACGGCGTC 


ft ^M^f** ft ^^fft ft 

ACCGACACCA 


TCTCGGCfCAT 




TATAGATGAA 


ACATTCGGTG 


AGTGTCTTCC 


CGTACTGGGG 


GAGGC CCAAG 


GCGGGTACGC 


16260 


CCTGGTCTGT 


AG CATGTATC 


TGCACGTTAT 


CGTCTCCATC 


TATTCGACAA 


aaacggtgta\ 


16320 


CAACAGTATG 


CTATTTAAAT 


GCACAAAGAA 


TAAAAAGTAC 


GACTGCATTG 


CCAAGCGGGT > 


^ 16380 


GCGGACAAAA 


TGGATGCGCA 


TGCTATCAAC 


GAAAGATACG 


TAGGTC CTCG 


CTGCCACCGT 


V6440 


TTGGC C CACG 


TGGTGCTGCC 


TAGGACCTTT 


CTG CTG CATC 


ACGCCATACC 


CCTGGAGCCC 


16S00 


GAGATCATCT 


TTTCCACCTA 


CACCCGGTTC 


AGCCGGTCGC 


CAGGGTCATC 


CCGCCGGTTG 


165)50 
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GTGGTGTGTG GGaWgW CCTGCCAGGG GAGGAAAACC AACTT. 
GGTTTGGCGC TTA Xg\c TCTGTTTTCC ac^ZT 
TCGGTACTGC GOATtA^ CCCTGCTTCT AAT^ 

= =^= === 

——c — 

»»=»«, r™™ 

«=«« c„ 3: ™ —» ™~ 

—«» r™*" °" e " eH " 

—CSC TAGAGGCCGC ^ AalC ° A, " !0, """""^ 

~» «. TACGGG8GCT 3" ™— 

x**™ sccscccoc cxocC^ 3° ""'"^ """"^ 

CGCAAACTT CGGAATTTCC TAAACAAGGA ATGCATATGG ACT A™W „ 
«««« 

~TC \ /™ 

«™ OTITCTOra ^ v ^ 

soe^c «»» 

*"■"" mmr ™ — — «2E 

TOOM.BC CSTC^ ca«A» 

«™« c^ccs. c.^^ ^» 

«•«». »»CT^ CTOi(i ^ J 

««» .«„ ^^ ^^ 



16620 
16680 
16740 
16800 
16860 
16920 
16980 
17040 
17100 
17160 
17220 
17280 
17340 
17400 
17460 
17520 
17580 
17640 
17700 
17760 
17820 
17880 
17940 
18000 
18060 
18120 
18180 
18240 
18300 
18360 
18420 
18480 
0 

18? 
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on 



CGAGAATGTG 
CCTGGGTCCT 
AACTTAAGGC 
CCGTCCTCCG 
GGGGGGGAAC 
CTTTTGGTGG 
GATCGCCACC 
TTGCTTAAAC 
CAATCCAAAC 
GTGCATTACT 
GTTCAGAAAG 
GTGCAGGGAG 
GACGTGCGCG 
GCGCGGCTTT 
TGACGTGGGC 
GTTTATTCTG 
TCGCGTGAAT 
CAGGTTGGCG 
CGTGTCCGAG 
GGACCTTAGC 
CCTCTTGGTT 
GCGGCCAGAC 
GCAGGGTTTC 
CGAAAAGGGC 
TCTGAATAAG 
GAAGGGTCTT 
CTCTTTTTGT 
GCTACTTCCA 
CCTGAAGTAC 
CCCGCCATTC 
TTCCGACACT 
AATTGCGGGT 
GTATATAACA 
CCCATCGTAT 



CTAAOAGGAG 
CTTACGAATG 
CCTGGGC 
GGTCGCGTG' 
CACCGACAAA 
GCTACGTACC 
ACTATCCATG 
GTCTGTAAAA 
AACTGGTGCG 
AG CTG CTGTT 
GACGAAAGGT 
TCGTGTGTTT 
TCCTCGCGCA 
GCGGGGACGT 
GTGGCGGGAT 
TGCACCACGC 
GTCGGGGCAC 
GCAAGGCGCT 
GGTGACCCGG 
GAGAACACAC 
TCCAACGTGA 
TTTAGTAATC 
GGGGCGGCGG 
CAGGTGTTCT 
GATGCGCTCC 
CTTCCTGACG 
CGGGCCGACT 
GGTCACATGT 
ACCAAGTTTC 
GCTACTTCTC 
TACATTTCCC 
CAGGGGACCC 
GGCAATGTTC 
GACATACCGG 



TGCTCGACGA 
TCTGACTACT 
CGTTCTTATT 
AGATTATGGT 
'CTTTAT 
iTGTTAAT 

tctXtccgtg 
cactgtttgg 

TCTT' 
GGAAGGGi 
TAACCAGAAA> 
TTCTGCCCGC 
CACACCGCAT 
TGTCCTCGGA 
GGTGCCGTGT 
CAATAAAAGG 
TCAGTTCCCA 
CCCTGTGACG 
AGGTGACCAG 
CTGGACAATT 
TGCC CGAGGT 
TCCACTTGCC 
GTGAGGAAAT 
ACAACCACTA 
TTCGGGAGGC 
ACCCCCTTCC 
GTTGCCTGTA 
ACGCTCCCAA 
TATACGGAGA 
GGATACAAAG 
ACACCTGCCT 
ACGTGGGTGG 
AGACCCAAAG 
CGATCATCAC 



GCAGTAQ 



187 

CGGCAC CTTG 
TCAGCCGCTT 
GAAGCATGTT 
TCCGTTCTCC 
GATTTCCGCG 
GTTCTCTACG 
TCTCCTTGGT 
AGTTTCAAAT 
GG C CTTG ATT 
CAG C CAGTGC 
L AG C CTGAAGT 
TGGTACTCG 
1CAAGTAT 

cggtVtgggg 

3TT 

GTGCGC( 
CCTCTCTC( 
GCTGAGCAGC 
TAGGTACGTC 
TAAGTTGATA 
CCAGCCAATC 
TAGACTGGAG 
CGCACTGGAC 
TGCTACCGAG 
TGTAGATGGC 
GTTG CTATGG 
CAAGCAG CAC 
ACGGGATCTT 
TTTTTCCGGG 
GGTAGTGAGT 
CTTGTGTCAC 
AATCCTACTG 
GTGTCCAACT 
CATGATCAAG 



GTGCCGTCCG 
GCTGATATAT 
GCGCACATCA 
TT CTTGATGT 
AACACGGAGT 
GATGCCAGTA 
ATTAGGAATA 
AAACCGAAGT 
GAAACCAAAA 
ACCCCGGGAA 
TCGCGGTAGA 
AC CAG TT GAT 
GTTGATAGGG 
GTTCCCACGT 
TCCAGGACCG 
CGTGCCGTTT 
GCGTCTTTGG 
TGTTTGCTT 
.GGGCGTAC 
lCTCCCC 

tgcAgtggcc 

AAGCTfcCAGA 

CCGTCTqACG 

GAGTGGACGT 

CTGTGTGACt 

CTGCTGTTCA 

TGCGGTTACC 

TTGTCGTTCG 

ACATGGG CGG 

CAGATGAAAA 

ATATATCAGC 

TTGAGTGGAA 

ACGGGCGACT 

GAGAATGGAC 



TCCAAGGCAC 
GAGTGTAAAA 
GCG AG CTGGA 
TTAAATTrTT 
TGGCTACGTG 
GCATGCTGAT 
CGCTTGCCTT 
ACTG CTTAAA 
AGAAAAAAGT 
CGTAACAGCC 
CAGAG CAGGC 
CGGCCGTGGA 
ACTCCAATAG 
CGGGATTTGC 
AACTGTATGA 
TGGGACAGTG 
CGGTCTCCTC 
TGAGCTCGCT 
AACTTGCCCT 
TGAACAGCTT 
GGCCGGCCTT 
GAGTC CTCGG 
TAGAAACACA 
GGGCTTTGAC 
CCGGAACTTG 
CGGACCCGC 
ICCCGGT 

ttAatcatgc 

CGGcN 
TCATAGATGC 
AAAATAGG^AT 
AAGGGACCC 
ATCTAATCAT 1 
TCAACCAACT 



18660 
18720 
18780 
18840 
18900 
18960 
19020 
19080 
19140 
19200 
19260 
19320 
19380 
19440 
19500 
19560 
19620 
19680 
19740 
19800 
19660 
19920 
19980 
20040 
20100 
20160 
20220 
20280 
20340 
20400 
20460 
20520 
20580 
J0640 
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CTAAAAGAGA gVttaTTAAG TCGGCTCTGG AGGCCAACAT CAACAGGAGG GCAGCTGTAT 

CGCTATTTGA 1 

(2) INFORMATION FOR SEQ ID NO:2: 

( i ) SEQUENOE CHARACTERISTICS * 

(A) LENGTH: 4131 base pairs 

(B) TYPfiV nucleic acid 

(C) STRAMDEDNESS : single 

(D) TOPOLOGY : linear 

DNA (genomic) 



(ii) 


MOLECULE TYPE 


(iii) 


HYPOTHETICAL: 


(iv) 


ANTI- SENSE: N 


fix) 


FEATURE: 




(A) NAME /KEY; 




(B) LOCATION: 



(D) OTHER 
(Xi) SEQUENCE DESCRIPTION: 



SQ ID NO: 2: 



ATG GAG GCG ACC TTG GAG CAA CGA 
Met Glu Ala Thr Leu Glu Gin Arg 
1 5 a 



TTC CCG TAC CTC GCC ACG GAG 
Phe Pro Tyr Leu Ala Thr Glu 

S E£ 25 Leu S SS S & S2 £ aT S 6 GAC ™ ™ 

2 0 iyS 25 S V Ala Ala Asp Gly Leu Phe 

SS S S2 S £S 2S S5 2SS £ D V AGA GA * « *» «« 

3 5 Leu G JJ L ^ S Arg Glu Gly Ser Val 

5 2 Si S 2 2 SJ !3 5 S 5% 2 2 JS 2 
w 512X2 £ s 2SSSS S^2 s 2 2 

£=2=222222222222 

SO \ 

2 5 S 2 2 2 22222 2 SteJ 2 2 

105 \10 

2 2 S 2 2 2 2222 SJ SJ 2 

120 

S 21 2 2 S S 2 2 2 2 2 2 2 2 
2 2 2 2 2 2 2 2 2 2 2 2 2 2 S*» 

150 155 -f^o 



GCG GAG 
Ala Glu 



GAG 
Glu 



20700 
20710 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 
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TCG GCT 
Ser Ala 



ACG GTT 
Thr Val 



AAG ACG 
Lys Thr 



TTG CAG 
Leu Gin 



CTC GCA 
Leu Ala 
18 0 

CTG GGC 
Leu Gly 
195 



TTTL GGT ATG 
Phe\Gly Met 
165 



GTT 

Val L>< 



CTT 
Leu 



GAC GCC 
Asp Ala 



CGG CAC 
Arg His 
185 



CTA GAA CGG 
Leu Glu Arg 
170 



GGG CTA 
Gly Leu 



GCT CCA CCC GTC 
Ala Pro Pro Val 



GAT CCC 
Asp Pro 



TAC TCT 
Tyr Ser 
200 



GAG AGG GGC 
Glu Arg Gly 



CTC 
Leu 
205 



TTT 
Phe 
190 

AAA 

Lys 



GTG GAC 
Val Asp 
175 

ATT TTA 
lie Leu 



AAG GCC 
Lys Ala 



GTC AAG TCT GAC ATG GTA TCff ATG TTC AAG GCA CAC CTC ATA GAA CAT 
Val Lys Ser Asp Met Val SeryMet Phe Lys Ala His Leu lie Glu His 
210 215\ 220 



TCA 
Ser 
225 

GTC 
Val 



TTT 
Phe 



CTA 
Leu 



GTC TTT 
Val Phe 



GCC GGC 
Ala Gly 



CTG CTG 
Leu Leu 
290 

AGC TAC 
Ser Tyr 
305 

AGG GCG 
Arg Ala 



CCC AAC 
Pro Asn 



GGA CAC 
Gly His 



TTT CTA 
Phe Leu 



ACC ATG 
Thr Met 



AAG GGT 
Lys Gly 
260 

GTC CTG 
Val Leu 
275 

GGG GAA 
Gly Gin 



GAT 
Asp 



CTC 
Leu 
245 

GTC 
Val 



AAG GCC 
Lys Ala 
230 

TCC GAC 
Ser Asp 



AGC ACG 
Ser Thr 



\.G CTC 
Glu Leu 



GAG ACG ACG 
Glu Thr Thr 



GTT GTC 
Val Val 



ATA 
He 



GAG 
Glu 
385 

TTC 
Phe 



GGG 
Gly 
370 

ACT 
Thr 



TTC 
Phe 



GTC TCA 
Val Ser 



ATG AGA 
Met Arg 



GCT CTG 
Ala Leu 
340 

GAC GAG 
Asp Glu 
355 

GAT AAG 
Asp Lys 



CAG TTT 
Gin Phe 



CCT GTT 
Pro Val 



GTC AGG 
Val Arg 
420 



GTG GAA AGT 
Val Glu Ser 
295 

AGG GGT GCC 
Arg Gly Ala 
310 

AAC TTT GAA 
Asn Phe Glu 
325 

CCG TCT GTG 
Pro Ser Val 



ATT CAG AGA 
He Gin Arg 



TTT GTG GCC 
Phe Val Ala 
375 

CCC TGC CCA 
Pro Cys Pro 
390 

GGC CTT CAC 
Gly Leu His 
405 

GGC GTA GAA 
Gly Val Glu 



GAC AGC 
Asp Ser 
280 

GCC ATG 
Ala Met 



AAC CTC 
Asn Leu 



CAG TTT 
Gin Phe 



GAA GGT 
Glu Gly 
345 

ACC CGC 
Thr Arg 
360 

ATT GAA 
He Glu 



ATG ACA AGG 
Met Thr Arg 
235 

GCC GCG GTG 
Ala Ala Val 
250 

ACG GCC TCT 
Thr Ala Ser 



ATG AGA 
Met Arg 



TCd GGG CCC 
Ser\Gly Pro 
300 

GTC aVc GCC 
Val Tttr Ala 

3: 

ATG GCA\CGC 
Met Ala 

330 

GAC AAG G 
Asp Lys 



ATC GCC GCC 
He Ala Ala 



GGG AAG 
Gly Lys 



TGC GAG 
Cys Glu 



GGG CAG 
Gly Gin 
270 

CGG CTG 
Arg Leu 
285 

GCG GCC 
Ala Ala 



CAG 
Gin 



GAT 
Asp 
255 

CAG 
Gin 



ATG AAC 
Met Asn 



TAC GCC 
Tyr Ala 




TAT 

Tyr 
240 

ACC 
Thr 



GTG 
Val 



CTG AAC 
Leu Asn 



CTT CCC 
Leu Pro 



TCC CCG 
Ser Pro 
425 



AGT TTG CAG 
Ser Leu Gin 
380 

CGG CGC ATC 
Arg Arg He 
395 

GTG CCC CGC 
Val Pro Arg 
410 

GCC ATC CAG 
Ala He Gin 



GTT AGC TAC GGA 
Val Ser Tyr Gly 
320 

ATA GTG GAC CAT 
He Val Asp His 

335 

GCT CTG GCG GAC 
Ala Leu Ala Asp 
350 

CT CTC GTC AAG 
er Leu Val Lys 
5 

CGS ATG TAC AAC 
ArmMet Tyr Asn 



CAG 
Gin 



\C ACC TAT 
Thr Tyr 
400 



TAC TCG ^CA TCC 
Tyr Ser Thr Ser 
5 

TCG ACC GAG\aCG 
Ser Thr Glu Thr 
430 



528 



576 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 
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TGG GTG 
Trp Val 



GTT AAT 
Val Asn hyl 
435 



AAC AAC 
Asn Asn 



GTG 
Val 
440 



CCT CTT TGC 
Pro Leu Cys 



TTC GGT TAC 
Phe Gly Tyr 
445 



CAA AAC 
Gin Asn 



1344 



GCC CTC 
Ala Leu 
450 



AAA AGC ATA 
Lys Ser lie 



!C CAC 
C^Ss His 
455 



CCT CGA ATG CAC 
Pro Arg Met His 



AAC CCC ACC 
Asn Pro Thr 
460 



CAG TCA 
Gin Ser 



1392 



GCC CAG 
Ala Gin 
465 



GCA CTA AAC 
Ala Leu Asn 



CAA &CT TTT CCC GAT CCC 
Gin Ala Phe Pro Asp Pro 
470 \ 475 



GAC GGG GGA CAT 
Asp Gly Gly His 



GGG 
Gly 
480 



1440 



TAC GGT 
Tyr Gly 



CTC 
Leu 



AGG TAT 
Arg Tyr 
485 



GAG CAG 
Glu Gin 



\CG CCA AAC ATG 
Chr Pro Asn Met 
490 



AAC CTA TTC 
Asn Leu Phe 



AGA 
Arg 
495 



ACG 
Thr 



1488 



TTC CAC 
Phe His 



CAG 
Gin 



TAT TAC 
Tyr Tyr 
500 



ATG GGG 
Met Gly 



GTG GCA 
sn Val Ala 



TTT GTT CCC GAT GTG 
Phe Val Pro Asp Val 
510 



1536 



GCC CAA 
Ala Gin 



CAC CGT 
His Arg 
530 



AAA 
Lys 
515 

CTC 
Leu 



GCG CTC 
Ala Leu 



CTC AGA 
Leu Arg 



GTA ACC ACG 
Val Thr Thr 
520 

TTG GAG GTC 
Leu Glu Val 
535 



GAT CTA 
Asp Leu 



CAC OCC TTC 
His Pso Phe 



CTG CAC CCA 
Leu His Pro 
525 

TTT GAT TTT 
Phe Asp Phe 
54 0 



ACC TCT 
Thr Ser 



TTT GTG 
Phe Val 



1584 



1632 



CAC CCC 
His Pro 
545 



TGT CCT GGA 
Cys Pro Gly 



GCG AGA GGA 
Ala Arg Gly 
550 



TCG TAC>CGC 
Ser Tyr Arg 

5! 



GCC ACC CAC 
Ala Thr His 



AGA ACA 
Arg Thr 
560 



1680 



ATG GTT 
Met Val 



GGA AAT ATA 
Gly Asn lie 
565 



CCA CAA CCG 
Pro Gin Pro 



CTC GCT CC 
Leu Ala Pro 1 
570 



AGG GAG TTT 
■g Glu Phe 



CAG GAA 
Gin Glu 

575 



1728 



AGT AGA 
Ser Arg 



GGG GCG CAG 
Gly Ala Gin 
580 



TTC GAC GCT 
Phe Asp Ala 



GTG ACG AAT 
Val Thr Asn 
585 



A\G ACA CAC 
Thr His 
590 



GTC ATA 
Val lie 



1776 



GAC CAG 
Asp Gin 



CTA ACT ATT 
Leu Thr lie 
595 



GAC GTC ATA 
Asp Val lie 
600 



CAG GAG ACG 
Gin Glu Thr 



GCA *TT GAC 
Ala Ptae Asp 
6B5 



CCC GCG 
Pro Ala 



1824 



TAT CCC 
Tyr Pro 
610 

GAA AAA 
Glu Lys 
625 

TAC TGG 
Tyr Trp 



GTT AGA 
Val Arg 



GCG CAC 
Ala His 



CAG GCG 
Gin Ala 
690 



CTG TTC TGC 
Leu Phe Cys 



TTC GTG ATG 
Phe Val Met 



GTC AAC TCG 
Val Asn Ser 
645 

TTC ATC TGT 
Phe lie Cys 
660 

GGC CAC TAC 
Gly His Tyr 
675 

CTT CTC AAG 
Leu Leu Lys 



TAT GTA ATC 
Tyr Val lie 
615 

AAC ATG CCC 
Asn Met Pro 
630 

GGA AAA CTG 
Gly Lys Leu 



ACG CAT ATT 
Thr His lie 



CGG AAA ATC 
Arg Lys lie 
680 

CTC GCG GGA 
Leu Ala Gly 
695 



GAA GCA ATG 
Glu Ala Met 



CTC ATT GCC 
Leu lie Ala 
635 

GCG TTT GTG 
Ala Phe Val 
650 

GGG AAT GGA 
Gly Asn Gly 
665 

TTA GGC GAG 
Leu Gly Glu 



CAC GAG ACG 
His Glu Thr 



ATT CAfl GGA 
He His\Gly 
620 

CTG GTC 
Leu Val I ] 



AAC AGT T 
Asn Ser Tyr 



CAG GAA 
Gin Glu 



CAA ACC 
Gin Thr 
640 

CAC ATG 
is Met 



AGC ATC CCT AAG GAG 
Ser He Pro Ly\ Glu 
670 

CTC ATC GCC CTT &AG 
Leu He Ala Leu G\Lu 
685 

GTG GGT CGG ACG CC 
Val Gly Arg Thr 
700 



1872 



1920 



1968 



2016 



2064 



2112 
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s s s 2 sig s s s js g a s 2 s s 

715 720 



TTT GCC TAC CAC GAT Gl 
Phe Ala Tyr His Asp 
725 



TTT ACG GAT CTT ATG CAG AAG TCA TCC AGA 
Phe Thr Asp Leu Met Gin Lys Ser Ser Arc 
730 73S 



" - "i K S Efe ~ Si !£ ^ - - S - £ 

\ 745 750 

AGG GCG ACA TTC ATC AAC Ct\ AGG GGT CGC ATG GAG GAC CTA CTC 
Arg Ala Thr Phe He Asn LeuY Arg Gly Arg Met Glu Asp Leu VaT £n 

\760 765 



AAC CTT GTT AAC ATT TAC CAG ~, 
Asn Leu Val Asn He Tyr Gin Tl 
770 775 



AGG GTC AAT GAG GAC CAT GAC GAG 
Arg Val Asn Glu Asp His Asp Glu 
780 



a - ™ = ~ ~ s ss k i= s< s? x k 

? 95 eoo 



GTC CTC GAG AAG CTA TTC TAC TAT G1 
Val Leu Glu Lys Leu Phe Tyr Tyr 
805 

GGC CAC ATG TGC GGT ATG GGG GTC GAC 
Gly His Met Cys Gly Met Gly Val Asp 
820 825 

CTG ACT TAC AAC GGC CCC GTC TTT GCG GAC^ 
Leu Thr Tyr Asn Gly Pro Val Phe Ala Asp 
835 840 



TTA ATG CCG GTG TGC AGT AAC 
kLeu Met Pro Val Cys Ser Asn 
1310 815 



CAA AAC GTG GCC CTG ACG 
Gin Asn Val Ala Leu Thr 
830 

TC GTG AAC GCA CAG GAT 
Val Asn Ala Gin Asp 
845 



" fil = 2 - = s " =? - ss£ ts a s as 
s s? ss s s s s s 25 s s s ss s 

870 875 \ ego 



CGC GTG ATC ACA 
VArg Val He Thr 
895 

GGG AAG GAT 
Gly Lys Asp 
910 



TCG TTT CTG ACG TGC CCT TTC GTC ACC CAG GCC GC 
Ser Phe Leu Thr Cys Pro Phe Val Thr Gin Ala , 
885 890 

AAG CGG GAC CCG GCC CAG AGT TTT GCC ACG CAC GAA 
Lys Arg Asp Pro Ala Gin Ser Phe Ala Thr His Glu 
900 905 

vl? 25? rf 3 5E C m GTT m mC m GGT GCG TTc\gCG GTG GCG 

Val Ala Gin Thr Val Leu Val Asn Gly Phe Gly Ala PheUla Val Ala 

15 920 925 

GAC CGC TCT CGC GAG GCG GCG GAG ACT ATG TTT TAT CCG GTA CCC TTT 
Asp Arg Ser Arg Glu Ala Ala Glu Thr Met Phe Tyr Pro vll Pro 
930 935 940 

AAC AAG CTC TAC GCT GAC CCG TTG GTG GCT GCC ACA CTG CaA CCG CTC 
Asn Lys Leu Tyr Ala Asp Pro Leu Val Ala Ala Thr Leu HisXPro Leu 
45 950 955 \ 960 

CTG CCA AAC TAT GTC ACC AGG CTC CCC AAC CAG AGA AAC GCG <*G GTC 
Leu Pro Asn Tyr Val Thr Arg Leu Pro Asn Gin Arg Asn Ala vSl Val 
965 970 g7 y 



2160 



2208 



2256 



2304 



2352 



2400 



2448 



2496 



2544 



2592 



2640 



2688 



2736 



2784 



2832 



2880 



2928 
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TTT AAC GTG CCA TCOL AAT CTC ATG GCA GAA TAT GAG GAA TGG CAC AAG 
Phe Asn Val Pro SerAAsn Leu Met Ala Glu Tyr Glu Glu Trp His Lys 
980 \ 985 990 



TCG CCC GTC GCG GCG T/ 
Ser Pro Val Ala Ala 
995 



GCC GCG TCT TGT CAG GCC ACC CCG GGC GCC 
Ala Ala Ser Cys Gin Ala Thr Pro Gly Ala 
1000 1005 



ATT AGC GCC ATG GTG AGC 
lie Ser Ala Met Val Ser Me 
1010 103 

ATT TGC CAG GCA AAA CAC CGC 
lie Cys Gin Ala Lys His Arg 
1025 1030 



CAC CAA AAA CTA TCT GCC CCC AGT TTC 
His Gin Lys Leu Ser Ala Pro Ser Phe 
1020 

?G CAC CCT GGT TTT GCC ATG ACA GTC 
>t His Pro Gly Phe Ala Met Thr Val 
1035 1040 



GTC AGG ACG GAC GAG GTT CTA 
Val Arg Thr Asp Glu Val Leu Al 
1045 



GCA\GAG CAC ATC CTA TAC TGC TCC AGG 
5lu His lie Leu Tyr Cys Ser Arg 
1050 1055 



GCG TCG ACA TCC ATG TTT GTG GGC TTfc CCT TCG GTG GTA CGG CGC GAG 
Ala Ser Thr Ser Met Phe Val Gly Leu\Pro Ser Val Val Arg Arg Glu 
1060 1065\ 1070 

GTA CGT TCG GAC GCG GTG ACT TTT GAA ATTT ACC CAC GAG ATC GCT TCC 
Val Arg Ser Asp Ala Val Thr Phe Glu Il\i Thr His Glu lie Ala Ser 
1075 1080 \ 1085 



CTG CAC ACC GCA CTT GGC TAC TCA TCA GTC 
Leu His Thr Ala Leu Gly Tyr Ser Ser Val I] 
1090 1095 

GCC GCC ATA ACT ACA GAC ATG GGA GTA CAT TGT 
Ala Ala lie Thr Thr Asp Met Gly Val His Cys 
1105 1110 1115 



GCC CCG GCC CAC GTG 
Ala Pro Ala His Val 
1100 

\G GAC CTC TTT ATG 
Ln Asp Leu Phe Met 
1120 



ATT TTC CCA GGG GAC GCG TAT CAG GAC CGC CAG CTG\ CAT GAC TAT ATC 
lie Phe Pro Gly Asp Ala Tyr Gin Asp Arg Gin Leu\pis Asp Tyr lie 
1125 1130 \ 1135 



AAA ATG AAA GCG GGC GTG CAA ACC GGC TCA CCG GGA 
Lys Met Lys Ala Gly Val Gin Thr Gly Ser Pro Gly 
1140 1145 

CAC GTG GGA TAC ACT GCT GGG GTT CCT CGC TGC GAG AAC 
His Val Gly Tyr Thr Ala Gly Val Pro Arg Cys Glu Asn 
1155 , 1160 1165 



AGA ATG GAT 
Arg Met Asp 

aiso 

CCC GGT 
Pro Gly 



TTG AGT CAT GGT CAG CTG GCA ACC TGC GAG ATA ATT CCC ACGVCCG GTC 
Leu Ser His Gly Gin Leu Ala Thr Cys Glu He He Pro Thr Yro Val 
1170 H75 1180 

ACA TCT GAC GTT GCC TAT TTC CAG ACC CCC AGC AAC CCC CGG GGB CGT 
Thr Ser Asp Val Ala Tyr Phe Gin Thr Pro Ser Asn Pro Arg GlA Arg 
1185 H90 1195 \1200 

GCG GCG TCG GTC GTG TCG TGT GAT GCT TAC AGT AAC GAA AGC GCA 
Ala Ala Ser Val Val Ser Cys Asp Ala Tyr Ser Asn Glu Ser Ala C 
1205 1210 1215 

CGT TTG TTC TAC GAC CAT TCA ATA CCA GAC CCC GCG TAC GAA TGC CGG 
Arg Leu Phe Tyr Asp His Ser He Pro Asp Pro Ala Tyr Glu Cys Arg 
1220 1225 1230 

TCC ACC AAC AAC CCG TGG GCT TCG CAG CGT GGC TCC CTC GGC GAC GTG 
Ser Thr Asn Asn Pro Trp Ala Ser Gin Arg Gly Ser Leu Gly Asp Val 
1235 1240 1245 



2976 



3024 



3072 



3120 



3168 



3216 



3264 



3312 



3360 



3408 



3456 



3504 



3552 



3600 



3648 



3696 



WO 96/06159 



PCT/US95/10194 



193 



Leu Tyr Asn lie T? 

1250 \ 1255 



1265 



Gly Leu Tyr Thr Leu Val 1 
1285 



Pro Ala Thr Ser Thr Thr As^ 

1300 \ 1305 

GAC GTG TTT TTG GAC CAG CCT TIGC 
Asp Val Phe Leu Asp Gin Pro Cys 
1315 13^0 



1330 



1335 



AAG CAG ACA CAC GCC CCA GTA CAC 
Lys Gin Thr His Ala Pro Val His 
1345 1350 



ATEG 
Mel 



GTG GCG CCG ATG AAG AGA CTA TTA AAG 
Val Ala Pro Met Lys Arg Leu Leu L} 
1365 

TAG 



GCG 
Ala 


CTG 
Leu 


CCG GGC 
Pro Gly 
1260 


ATG 
Met 


TAC 
Tyr 


AGT 
Ser 


GAC 
Asp 


ATT ATG 
He Met 
1275 


CGG 
Arg 


TAC 
Tyr 


AAT 
Asn 


AGG 
Arg 
1280 


TCT GCC 
Ser Ala 
1290 


AGG 
Arg 


CTT 
Leu 


GCT 
Ala 


GGG GCC 
Gly Ala 
1295 


TAC 
Tyr 


GTC 
Val 


GTG 
Val 


GTC 
Val 


AAC GGT 
Asn Gly 
1310 


ACA 
Thr 


ATG 
Met 


CTG 
Leu 


CAG 
Gin 


GAG GCC 
Glu Ala 
1325 


TAT 
Tyr 


CCC 
Pro 


CTT 
Leu 


GCC 
Ala 


GAG TAC 
Glu Tyr 
1340 


ATG 
Met 


TCA 
Ser 


AAC 
Asn 


GGC 
Gly 


CAG TAT 
Gin Tyr 
1355 


CTC 
Leu 


ATT 
He 


GAA 
Glu 


GAG 
Glu 
1360 


CTC GGA AAC 
Deu Gly Asn 
1370 


AAG 
Lys 


GTG 
Val 


GTG TAT 
Val Tyr 
1375 



3792 



3840 



3888 



3936 



3984 



4032 



4080 



4128 



4131 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1376 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 



Met 
1 

Ala 

Lys 

Arg 

Lys 
65 

Lys 
Met 



Glu Ala Thr Leu Glu Gin Arg Pro Phe Pro Tyj\ Leu 
5 10 

Asn Leu Leu Thr Gin He Lys Glu Ser Ala Ala 
20 25 

Ser Phe Gin Leu Leu Leu Gly Lys Asp Ala Arg 

35 40 4 



Ala Thr Glu 
15 

Gly Leu Phe 
30 



Phe Glu Ala Leu Leu Gly Val Tyr Thr Asn Val Val 
50 55 60 

Phe Leu Glu Thr Ala Leu Ala Ala Ala Cys Val Asn 

70 75 

Asp Leu Arg Arg Met He Asp Gly Lys He Gin Phe 
85 90 

Pro Thr He Ala His Gly Asp Gly Arg Arg Pro Asn 
100 105 



GlV Gly Ser Val 
lu Phe Val 



Glu Phe 
80 

Lys \le Ser 
5 

Lys Gl\ Arg 
110 



i; a 



rSi 
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«- ^ s . val y t Lys Ala ^ ^ Lys 4 His ^ ^ ^ ^ 



He Glu Leu Ala 
130 



Al\ 



Thr Pro Leu Asp Phe 
145 

Ser Ala Leu Gin Phe 
165 



Ala Asp He Glu Leu Leu P he Ala Glu Lys Mu 

140 



rhr 



0 G1 " ^ Ma «• ^ Thr He Thr 

155 _ _ A 

160 

Met Asp Ala u» Glu Arg Gly Leu Val Asp 

0 175 



Thr Val Leu Ala Val Lys^eu Ar g ». Ala Pro Pro Val Phe Xle Leu 

° 5 190 



Lys Thr Leu Gly Asp Pro 
195 



Tyr Ser Glu Arg Gly Leu Lys Lys Ala 



Val Lys ser Asp Met Val Se^ et Phe Lys ^ ^ ^ ^ ^ ^ 



Ser Phe Phe Leu Asp Lys Ala 
" 5 230 



Leu Met Thr Arg Gly Lys Gin Tyr 

240 



Val Leu Thr Met Leu Ser As« m~A\ 

Asp MetXLeu Ala Ala Val Cys Glu Asp. Thr 

Val Phe Lys Gly Val Ser Thr Tyr Vr Thr *i e 

260 Thr Ala Ser Gly Gin Gin Val 

Ala Gly Val Leu Glu Thr th^ » _ \ \ 

Thr Thr Asp se^val Met Arg Arg Leu Met Asn 

Leu Leu Gly Gin Val Glu Ser Ala Met 
y ° 295 



*er 



Gly Pro Ala Ala Tyr Ala 

V 300 



«« Tyr V a l Va i ^ ^ ^ ^ ^ V ^ ^ ^ ^ ^ ^ 
Ala Met ^ ^ phe Qlu V ^ ^ ^ »0 



335 



Pro Asn Ala Leu Pro Ser Val Glu Gly Ast> iA an 

340 AU Lyi Ala Ala Leu Ala Asp 

Gly His Asp Glu Xle Gin Ar g Thr Ar 9 xle Ala U |e r Leu Val Lys 
XL «y Asp Lys Phe Val Ala Xle Clu Ser Leu o\Z Met ^ Asn 

«u Thr «. Phe Pro Cys Pro Leu Asn Ar g Ar f *A a ^ Thr Tyr 

J95 X 400 



Phe Phe Pro Val Gly Leu His Leu Pro Val Pro Ar g 

US 410 



Ser Thr Ser 
415 



Val Ser Val Arg Gly Val Glu Ser Pro Ala II* rin c V 

42 0 42 1 6r % r Glu T *ir 

Trp val Val Asn Lys Asn Asn Val Pro Leu Cys Phe Gly ^ 0 

440 44 | 



Gin Asn 



Ala Leu Lys Ser Xle Cys His Pro Ar g Met His Asn Pro Thr Vn Ser 
Ala Gin Ala Leu Asn Gin Ala Phe Pro Asp Pro Asp Gly Giy H A Gly 



Ox* 



4s 
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475 



480 



465 \ M .t- Asn Leu Phe Arg Thr 

irfl Tvr GliAGln Thr Pro Asn Met Asn Le ^ 
Tyr Gly Leu Arg Tyr giux 490 

\ * S n val Ala Phe Val Pro Asp Val 

Mot- gav Lys Asn vai S10 
Phe His Gin Tyr Tyr Met G*y y ^ 

500 \ His Pro Thr Ser 

r - « - «■ A - - a w "" v " 

His Arg Leu Leu Arg n ^ x 

i Ala Arg Gly\er Tyr Arg Ala Thr His Arg Thr 
His Pro cys Pro Gly Ala Arg Gly ^ 555 



545 



A Pro Arg Glu Phe Gin Glu 

Met val Gly Asn lie Pro Gin Pro L^u Ala 

, Al a Gin Phe Asp Ala ValVhr Asn Met Thr Hi. Val He 

Ser Arg Gly Ala Gin *ne * S Q5 \ 

Z Xle ~, Val Xle Gin \« Thr Ala Phe Asp Pro Ala 

Asp Gin Leu Thr lie &Q0 ^ 

595 



0 _ rvs Tyr Val He Glu Al 
Tyr Pro Leu Phe Cys Tyr ^ 

610 



.Met 



He His Gly Gin Glu 
620 



61° \-, val He Gin Thr 

«. le n Met Pro Leu He fl£a Leu Val ±a g4Q 

i~.v.a. Met Asn Met fj.^> *- v_ 

Glu Lys Phe Val wet ^ 6 ^> 

ser Gly Lys Leu Ala Phe VaUsn Ser Tyr His Me, 
Tyr Trp Val Asn Ser Gly Ly g50 X 

Ile ^ Tbr His He Gly Asn Gly W He Pro Lys Glu 

Val Arg Phe He Cys in* 665 \ 

660 \ -no, ala Leu Glu 

■ Glv His Tyr Arg Lys He Leu Gly Glu ll- 

Ala His Gly His =» g80 \ 

=1 „ Z „ - - « - - - * V ~ - " 

x , », « ~ » - - - - s - V "° ; " 

7 0S , Mftt Gln LysXSer Ser Arg 

• val Phe Thr Asp Leu Met Gin uy \ 73& 

Phe Ala Tyr His Asp Val *n 730 \ 

a«i pVo Gin Asn 

«. « « .» «• « - s - p 
„ Z « - - « « - - 5 " w "v 1 

Arg Ala Thr Phe lie ~= 7gQ \ 

™ - » w - - - v " - - - V al ° 

Asn Leu Val Asn "775 \ 

val Ala Pro Leu Asp Glu Asn Asp Tyr As* Pro 
Arg His val Leu Asp Val Ala 7.S \ 

Phe Tyr Tyr Val Leu Met Pro val Cys serW 
val Leu Glu Lys Leu Phe Tyr Tyr ^ y 

Gly Me, Gly val Asp Tyr Gin Asn Val Ala Leu *hr 
Gly His Met Cys Gly Met y ^ \ 

3 820 \ 
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Leu Thr Tyr As A Gly Pro Val Phe Ala Asp Val Val Asn Ala Gin Asp 
835 \ 840 845 



Asp lie Leu Leu 
850 



Leu Glu Asn Gly Thr Leu Lys Asp lie Leu Gin 
855 860 



Ala Gly Asp lie Argt Pro Thr Val Asp Met lie Arg Val Leu Cys Thr 
865 \870 875 880 

Ser Phe Leu Thr Cys fero Phe Val Thr Gin Ala Ala Arg Val lie Thr 
885 \ 890 895 

Lys Arg Asp Pro Ala GlrV Ser Phe Ala Thr His Glu Tyr Gly Lys Asp 
900 \ 905 910 

Val Ala Gin Thr Val Leu vVl Asn Gly Phe Gly Ala Phe Ala Val Ala 
915 \ 920 925 



Asp Arg Ser Arg Glu Ala Ala> 
930 935 



Thr Met Phe Tyr Pro Val Pro Phe 
940 



Asn Lys Leu Tyr Ala Asp Pro 
945 950 



Val Ala Ala Thr Leu His Pro Leu 
955 960 



Leu Pro Asn Tyr Val Thr Arg Leu Pro Asn Gin Arg Asn Ala Val Val 
965 \ 970 975 

Phe Asn Val Pro Ser Asn Leu Met AlliGlu Tyr Glu Glu Trp His Lys 
980 985\ 990 

Ser Pro Val Ala Ala Tyr Ala Ala Ser cVs Gin Ala Thr Pro Gly Ala 
995 1000 \ 1005 

He Ser Ala Met Val Ser Met His Gin LysV-eu Ser Ala Pro Ser Phe 
1010 1015 \ 1020 

He Cys Gin Ala Lys His Arg Met His Pro Gl\ Phe Ala Met Thr Val 

1025 1030 1035 1040 



val Arg Thr Asp Glu Val Leu Ala Glu His lie Deu Tyr Cys Ser Arg 
1045 1050 \ 1055 



Ala Ser Thr Ser Met Phe Val Gly Leu Pro Ser 
1060 1065 



Val\val Arg Arg Glu 
1070 



Val Arg Ser Asp Ala Val Thr Phe Glu He Thr His Glu He Ala Ser 
1075 1080 10185 

Leu His Thr Ala Leu Gly Tyr Ser Ser Val He Ala Pro\Ala His Val 
1090 1095 1100 

Ala Ala He Thr Thr Asp Met Gly Val His Cys Gin Asp Leu Phe Met 
•1105 1110 1115 \ 1120 

He Phe Pro Gly Asp Ala Tyr Gin Asp Arg Gin Leu His Asp\Tyr He 
1125 1130 

Lys Met Lys Ala Gly Val Gin Thr Gly Ser Pro Gly Asn Arg 
1140 1145 1150 

His Val Gly Tyr Thr Ala Gly Val Pro Arg Cys Glu Asn Leu Pro tely 
1155 1160 1165 

Leu Ser His Gly Gin Leu Ala Thr Cys Glu He lie Pro Thr Pro 
1170 1175 1180 

Thr Ser Asp Val Ala Tyr Phe Gin Thr Pro Ser Asn Pro Arg Gly Arg 
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1185 



1190 



1195 



1200 



Ala Ala Ser Val Val Ser Cys Asp Ala Tyr Ser Asn Glu Ser Ala Glu 
1206 1210 1215 

Arg Leu Phe Tyr Asp His Ser lie Pro Asp Pro Ala Tyr Glu Cys Arg 
1220 \ 1225 1230 



Ser Thr Asn Asn Pro 
1235 



Ala Ser Gin Arg Gly Ser Leu Gly Asp Val 
1240 1245 



Leu Tyr Asn lie Thr Phe Vrg Gin Thr Ala Leu Pro Gly Met Tyr Ser 
1250 &55 1260 

Pro Cys Arg Gin Phe Phe HiV Lys Glu Asp lie Met Arg Tyr Asn Arg 
1265 1270 \ 1275 1280 

Gly Leu Tyr Thr Leu Val Asn Blu Tyr Ser Ala Arg Leu Ala Gly Ala 
1285 \ 1290 1295 

Pro Ala Thr Ser Thr Thr Asp Le\ Gin Tyr Val Val Val Asn Gly Thr 
1300 \ 1305 1310 

Asp Val Phe Leu Asp Gin Pro Cys Ilis Met Leu Gin Glu Ala Tyr Pro 
1315 1320 \ 1325 

Thr Leu Ala Ala Ser His Arg Val Me\ Leu Ala Glu Tyr Met Ser Asn 
1330 1335 \ 1340 

Lys Gin Thr His Ala Pro Val His Met Oly Gin Tyr Leu lie Glu Glu 
1345 1350 \ 1355 1360 

Val Ala Pro Met Lys Arg Leu Leu Lys LeA Gly Asn Lys Val Val Tyr 
1365 137fiL 1375 



(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1143 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: N 
<iv) ANTI-SENSE: N 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 - . 1143 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

AGC ATT CGG GGA CAG ACC TTT AAC CTG CTC TAC GTA GAC GAG GC 

Ser lie Arg Gly Gin Thr Phe Asn Leu Leu Tyr Val Asp Glu Al< 
1 5 10 

TTT ATT AAA AAG GAT GCA CTG CCG GCT ATT CTG GGT TTC ATG CTT 

Phe lie Lys Lys Asp Ala Leu Pro Ala He Leu Gly Phe Met Leu 
20 25 30 



AAT 
Asn 



48 



96 
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AAA GAC GCC 
Lys Asp Ala 
35 

TCC ACG AGT 
Ser Thr Ser 
50 

AAT GTG GTC 
Asn Val Val 
65 

CAA GAC GCA 
Gin Asp Ala 



TAC ATC ACC 
Tyr lie Thr 



GAG GGG GCA 
Glu Gly Ala 
115 

AAT GCT ACG 
Asn Ala Thr 
130 

GAC ATG TGT 
Asp Met Cys 
145 

GGA AAA CAG 
Gly Lys Gin 



GAG GCG TCC 
Glu Ala Ser 



CCC ACC AGA 
Pro Thr Arg 
195 

CTC ACT GGC 
Leu Thr Gly 
210 

ATT AAG GCG 
He Lys Ala 
225 

GCG GTC GAA 
Ala Val Glu 



GTC CTT AAC 
Val Leu Asn 



GAC AAG AGC 
Asp Lys Ser 
275 

AAA TCC TCC 
Lys Ser Ser 
290 



AAG CTT ATA TTT 
Lys Leu KLe Phe 



TTC CTG CTt AAC 
Phe Leu Leu\Asn 
55 

AGT TAC GTG 
Ser Tyr Val 
70 

CTA GTG TCC TGT' 
Leu Val Ser Cys 
85 

ATC GAC GAA TCC 
He Asp Glu Ser 

100 

TTC GAC ACC GAA 
Phe Asp Thr Glu 



ATA TCA 
He Ser 
40 

CTC AGG 
Leu Arg 



GCG GAC 
Ala Asp 



:CT TGT 
Cys 



CTT TAC CGC GTG 
Leu Tyr Arg Val 
135 



CTA AYG 
Leu Mi 
120 

GTG GGT 
Val Gly 



TCC GTG 
Ser Val 



AAC GCC 
Asn Ala 



CAC CGA 
His Arg 
75 

TAC AGA 
Tyr Arg 
90 

ACC ACC 
Thr Thr 



GGC GAG 
Gly Glu 



\C GCA 
3p Ala 



AAC TCG TCA 
Asn Ser Ser 
45 

CAG GAA AAG 
Gin Glu Lys 
60 

GAA GAT TTC 
Glu Asp Phe 



GAC CGC 
Asp Arg 



ATG CTG 
Met Leu 



CTG CAC ATT 
Leu His He 



ACC AAC CTC 
Thr Asn Leu 
110 

GGA GCA GCG 
Gly Ala Ala 
125 

GCG CTG ACA 
Ala Leu Thr 
140 



CAC CTG 
His Leu 
80 

CCG ACG 
Pro Thr 
95 

TTT ATG 
Phe Met 



TCG TCA 
Ser Ser 



CAG TTT 
Gin Phe 



GCA GCT 
Ala Ala 



ATC GCT 
He Ala 



GGC AAC 
Gly Asn 
245 

GAA ATA 
Glu He 
260 

AGC GCC 
Ser Ala 




CGG GTA GAC ACC 
Arg Val Asp Thr 
150 

CTG TTT GTT TAC 
Leu Phe Val Tyr 
165 

GGT ACT GGC GTG 
Gly Thr Gly Val 
180 

AGC CTC ATA TTG 
Ser Leu He Leu 



ACC GCC 
Thr Ala 



GCT 
Ala 



GTG 
Val 
230 

AGC 
Ser 



TAC 
Tyr 
215 

CTC 
Leu 



AGC 
Ser 



ATC GAC 
He Asp 



GGC GCC 
Gly Ala 
185 

GGC ATG 
Gly Met 
200 

GAG ATA 
Glu He 



GTT CAG AAG 
Val Gin Lys 



CCC GCiB TAT ACG AAC 
Pro Ala\Tyr Thr Asn 
170 



GTT GTC 
Val Val 



AGT ACT 
Ser Thr 
190 



TGC CCG 
Cys Pro 



CTG CAG 
Leu Gin 



GCG TTT GAG ACA 
Ala Phe Glu Thr 
295 



CAC ACC 
His Thr 



CAA GAT 
Gin Asp 



CTC CCC 
Leu Pro 
265 

TGG CCA 
Trp Pro 
280 

TTC ATC 
Phe He 



GAG CAT TTC \TC CTG 
Glu His Phe Phe Leu 
15 

GCG TCC TGC GCA TGC 
Ala Ser Cys Ala 
220 

ACA ATT GAG CGC 
Thr He Glu Arg Va 
235 

TCT GGG GTG GCC ATT 1 
Ser Gly Val Ala lie 
250 

ATA CAT TTT CTA CAC 
He His Phe Leu His 
270 

ATT TAC ATG TTG GGA 
He Tyr Met Leu Gly 
265 

TAC GCT CTG AAC TCC 
Tyr Ala Leu Asn Ser 
300 



TGC CTT 
Cys Leu 
160 

AAC ACG 
Asn Thr 
175 

CAG ACT 
Gin Thr 



CGC GAC 
Arg Asp 



ACG ATG 
Thr Met 



AAC GCG 
Asn Ala 
240 



GGC GAG 
Gly Glu 



\ 



GGC ACG 
Gly Thr 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 



912 
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CTG AGC 
Leu Ser 
305 

GAC CCG 
Asp Pro 



GCC A3C\CAG ACG GTG GTG TCC 
Ala s\r \Gln Thr Val Val Ser 
310 

GTG AC&TAC CTG GTA GAA CAG 
Val Thr\Tyr Leu Val Glu Gin 
J25 



AAC ACC 
Asn Thr 
315 

GTC CGC 
Val Arg 
330 



ATC 
He 



GCG 
Ala 



AAA ATA 
Lys He 



ATC AAG 

He Lys 



TCA TTT 
Ser Phe 

320 

TGC GTC 
Cys Val 
335 



CCG CTT AGG GAT G&A GGG CAG TCA TAC AGC GCC AAG CAA AAG CAC ATG 
Pro Leu Arg Asp Gly Gly Gin Ser Tyr Ser Ala Lys Gin Lys His Met 
340 \ 345 350 



TCG GAC 
Ser Asp 



GAT GAT 
Asp Asp 
370 



GAC TTA CTT ^GTG GCA GTT GTC ATG GCC 
Asp Leu Leu ^al Ala Val Val Met Ala 
355 \ *\ 360 

AGA CAC ATG TACyAAG CCC ATA TCC CCA 
Arg His Met Tyr V/ys Pro He Ser Pro 



CAT 
His 



CAA 
Gin 
380 



TTT ATG 
Phe Met 
365 

TAA 



GCT ACC 
Ala Thr 



960 



1008 



1056 



1104 



1143 



(2) INFORMATION FOR SEQ ID>NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 380 \amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: proteif 

(xi) SEQUENCE DESCRIPTION: 3EQ ID NO: 5: 

Ser He Arg Gly Gin Thr Phe Asn Leu ^eu Tyr Val 
1 5 

Phe. He Lys Lys Asp Ala Leu Pro Ala Il\ Leu Gly 
20 25 

Lys Asp Ala Lys Leu He Phe He Ser Ser V^l Asn 
35 40 



Asp Glu Ala Asn 
15 

Phe Met Leu Gin 
30 

Ser Ser Asp Arg 
45 



Ser Thr Ser Phe Leu Leu Asn Leu Arg Asn AlaXGln Glu Lys Met Leu 
50 55 \60 



Asn Val Val Ser Tyr Val Cys Ala Asp His Arg G>| 
65 70 75 

Gin Asp Ala Leu Val Ser Cys Pro Cys Tyr Arg 

85 90 

Tyr He Thr He Asp Glu Ser He Lys Thr Thr Thr 
100 105 

Glu Gly Ala Phe Asp Thr Glu Leu Met Gly Glu Gly 
115 120 

Asn Ala Thr Leu Tyr Arg Val Val Gly Asp Ala Ala 
130 135 140 

Asp Met Cys Arg Val Asp Thr Thr Ala Gin Glu Val 
145 150 155 

Gly Lys Gin Leu Phe Val Tyr He Asp Pro Ala Tyr 
165 170 

Glu Ala Ser Gly Thr Gly Val Gly Ala Val Val Thr 
180 185 



Asp Phe His Leu 
80 

iis He Pro Thr 
95 

Leu Phe Met 

t 110 

La Ser Ser 

125' 

Leu TW Gin Pn e 



Gin Lys\Cys Leu 
160 

Thr Asn A\n Thr 
1* 

Ser Thr Gln^Thr 
190 
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Pro Thr 

Leu Thr 
210 

He Lys 
225 

Ala Val 
Val Leu 
Asp Lys 



Arg Ser Leu 
195 

Gly Ala Ala Ala 



Lys Ser 
290 

Leu Ser 
305 



Ala He Ala Val 
230 

Glu Gly Asn Ser 
245 

Asn Glu He Cys 
260 

Ser Ser Ala Leu 
275 

Ser Ala Phe Glu 



Ala Ser Gin Thr 
310 



Asp Pro Val Thr Tyr Leu 
325 



Pro Leu 



Ser Asp 



Asp Asp 
370 



Arg Asp Gly Gly 
340 

Asp Leu Leu Val 
355 

Arg His Met Tyr 



Leu Gly Met Glu His Phe Phe Leu Arg Asp 
200 205 

rv Glu He Ala Ser Cys Ala Cys Thr Met 
L5 220 

Lei\His Thr Thr He Glu Arg Val Asn Ala 
235 240 

Ser G>n Asp Ser Gly Val Ala He Ala Thr 
250 255 

Pro Leu ^ro He His Phe Leu His Tyr Thr 
5 270 

Gin Trp Pro. He Tyr Met Leu Gly Gly Glu 
280 \ 285 

Thr Phe lie TVr Ala Leu Asn Ser Gly Thr 

295 \ 300 

Val Val Ser Asn\Thr He Lys He Ser Phe 

15 320 

Val Glu Gin Val Ara Ala He Lys Cys Val 
330 \ 335 

Gin Ser Tyr Ser Ala \ys Gin Lys His Met 
345 \ 350 

Ala Val Val Met Ala Hi\ Phe Met Ala Thr 
360 \365 

Lys Pro He Ser Pro Gin 
375 380 



< 2 ) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 234 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: N 
(iv> ANT I - SENSE : N 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..234 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

ATG GGT GAG CCA GTG GAT CCT GGA CAT GTG GTG AAT GAG AAA GAT TTT 
Met Gly Glu Pro Val Asp Pro Gly His Val Val Asn Glu Lys Asp Phe 
15 10 15 

GAG GAG TGT GAA CAA TTT TTC AGT CAA CCC CTT AGG GAG CAA GTG GTC 
Glu Glu Cys Glu Gin Phe Phe Ser Gin Pro Leu Arg Glu Gin Val Val 
20 25 30 
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GCG GGG GTC AGG GCA CTC GAC GGC CTC GGT CTC GCT GAC TCT CTA TGT 144 
Ala Gly Val Arg Ala Leu Asp Gly Leu Gly Leu Ala Asp Ser Leu Cys 
35 40 _ 45 

CAC AAA ACA GAA AGA CTC TGC CTG CTG ATG GAC CTG GTG GGC ACG GAG 192 
His Lys Thr Glu Arg Leu Cys Leu Leu Met Asp Leu Val Gly Thr Glu 
50 55 60 

TGC TTT GCG AGG GTG TGC CGC CTA GAC ACC GGT GCG AAA TGA 234 
Cys Phe Ala Arg Val Cys Arg Leu Asp Thr Gly Ala Lys 
6S 70 75 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Met Gly Glu Pro Val Asp Pro Gly His Val Val Asn Glu Lys Asp Phe 
15 10 15 

Glu Glu Cys Glu Gin Phe Phe Ser Gin Pro Leu Arg Glu Gin Val Val 
20 25 30 

Ala Gly Val Arg Ala Leu Asp Gly Leu Gly Leu Ala Asp Ser Leu Cys 
35 40 45 

His Lys Thr Glu Arg Leu Cys Leu Leu Met Asp Leu Val Gly Thr Glu 
50 55 60 

Cys Phe Ala Arg Val Cys Arg Leu Asp Thr Gly Ala Lys 
65 70 75 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 585 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: N 
(iv) ANTI-SENSE: N 

( ix ) FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..585 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

ATG AAG AGT GTG GCG AGT CCC TTA TGT CAG TTC CAC GGC GTG TTT TGC 
Met Lys Ser Val Ala Ser Pro Leu Cys Gin Phe His Gly Val Phe Cys 
1 5 10 15 



CTG TAC CAG TGT CGC CAG TGC CTG GCA TAC CAC GTG TGT GAT GGG GGC 



96 
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Leu Tyx Gin Cys Arg Gin Cys Leu Ala Tyr His Val Cys Asp Gly Gly 
20 25 - 30 

GCC GAA TGC GTT CTC CTG CAT ACG CCG GAG AGC GTC ATC TGC GAA CTA 144 
Ala Glu Cys Val Leu Leu His Thr Pro Glu Ser Val ^le Cys Glu Leu 
35 40 45 

ACG GGT AAC TGC ATG CTC GGC AAC ATT CAA GAG GGC CAG TTT TTA GGG 192 
Thr Gly Asn Cys Met Leu Gly Asn lie Gin Glu Gly Gin Phe Leu Gly 
50 55 60 

CCG GTA CCG TAT CGG ACT TTG GAT AAC CAG GTT GAC AGG GAC GCA TAT 24 0 

. Pro Val Pro Tyr Arg Thr Leu Asp Asn Gin Val Asp Arg Asp Ala Tyr 
65 70 75 80 

CAC GGG ATG CTA GCG TGT CTG AAA CGG GAC ATT GTG CGG TAT TTG CAG 2 88 

His Gly Met Leu Ala Cys Leu Lys Arg Asp lie Val Arg Tyr Leu Gin 

85 90 95 

ACA TGG CCG GAC ACC ACC GTA ATC GTG CAG GAA ATA GCC CTG GGG GAC 336 
Thr Trp Pro Asp Thr Thr Val lie Val Gin Glu lie Ala Leu Gly Asp 
100 105 110 

GGC GTC ACC GAC ACC ATC TCG GCC ATT ATA GAT GAA ACA TTC GGT GAG 384 
Gly Val Thr Asp Thr lie Ser Ala lie lie Asp Glu Thr Phe Gly Glu 
115 120 125 

TGT CTT CCC GTA CTG GGG GAG GCC CAA GGC GGG TAC GCC CTG GTC TGT 432 
Cys Leu Pro Val Leu Gly Glu Ala Gin Gly Gly Tyr Ala Leu Val Cys 
130 135 140 

AGC ATG TAT CTG CAC GTT ATC GTC TCC ATC TAT TCG ACA AAA ACG GTG 480 
Ser Met Tyr Leu His Val lie Val Ser lie Tyr Ser Thr Lys Thr Val 
145 150 155 160 

TAC AAC AGT ATG CTA TTT AAA TGC ACA AAG AAT AAA AAG TAC GAC TGC 528 
Tyr Asn Ser Met Leu Phe Lys Cys Thr Lys Asn Lys Lys Tyr Asp Cys 
165 170 175 

ATT GCC AAG CGG GTG CGG ACA AAA TGG ATG CGC ATG CTA TCA ACG AAA 576 
He Ala Lys Arg Val Arg Thr Lys Trp Met Arg Met Leu Ser Thr Lys 
180 185 190 

GAT ACG TAG 585 
Asp Thr 

195 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 194 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Lys Ser Val Ala Ser Pro Leu Cys Gin Phe His Gly Val Phe Cys 
1 5 10 15 

Leu Tyr Gin Cys Arg Gin Cys Leu Ala Tyr His Val Cys Asp Gly Gly 
20 25 30 

Ala Glu Cys Val Leu Leu His Thr Pro Glu Ser Val He Cys Glu Leu 
35 40 45 
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Thr Gly Asn Cys Met Leu Gly Asn lie Gin Glu Gly Gin Phe Leu Gly 
50 55 60 

Pro Val Pro Tyr Arg Thr Leu Asp Asn Gin Val Asp Arg Asp Ala Tyr 
65 70 75 80 

His Gly Met Leu Ala Cys Leu Lys Arg Asp lie Val Arg Tyr Leu Gin 

85 90 95 

Thr Trp Pro Asp Thr Thr Val lie Val Gin Glu He Ala Leu Gly Asp 
100 105 110 

Gly Val Thr Asp Thr He Ser Ala He He Asp Glu Thr Phe Gly Glu 
115 120 125 

Cys Leu Pro Val Leu Gly Glu Ala Gin Gly Gly Tyr Ala Leu Val Cys 
130 135 140 

Ser Met Tyr Leu His Val He Val Ser He Tyr Ser Thr Lys Thr Val 
145 150 155 160 

Tyr Asn Ser Met Leu Phe Lys Cys Thr Lys Asn Lys Lys Tyr Asp Cys 
165 170 175 

He Ala Lys Arg Val Arg Thr Lys Trp Met Arg Met Leu Ser Thr Lys 
«U 180 185 190 

ffi Asp Thr 

fy {2) INFORMATION FOR SEQ ID NO: 10: 

fph (i) SEQUENCE CHARACTERISTICS: 

" (A) LENGTH: 93 9 base pairs 

^ (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: N 
(iv) ANTI-SENSE: N 

(ix) FEATURE: 

(A) . NAME /KEY : CDS 

(B) LOCATION: 1 . . 939 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

ATG GCT AGC CGG AGG CGC AAA CTT CGG AAT TTC CTA AAC AAG GAA TGC 4 8 

Met Ala Ser Arg Arg Arg Lys Leu Arg Asn Phe Leu Asn Lys Glu Cys 
15 10 15 

ATA TGG ACT GTT AAC CCA ATG TCA GGG GAC CAT ATC AAG GTC TTT AAC 96 
He Trp Thr Val Asn Pro Met Ser Gly Asp His He Lys Val Phe Asn 
20 25 30 

GCC TGC ACC TCT ATC TCG CCG GTG TAT GAC CCT GAG CTG GTA ACC AGC 144 
Ala Cys Thr Ser He Ser Pro Val Tyr Asp Pro Glu Leu Val Thr Ser 
35 " 40 45 

TAC GCA CTG AGC GTG CCT GCT TAC AAT GTG TCT GTG GCT ATC TTG CTG 192 
Tyr Ala Leu Ser Val Pro Ala Tyr Asn Val Ser Val Ala He Leu Leu 
50 55 60 
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CAT AAA GTC .ATG GGA. CCG. TGT GTG. GCT GTG GGA .ATT AAC GGA GAA ATG ■ 24 0 

His Lys Val Met Gly Pro Cys Val Ala Val Gly lie Asn Gly Glu Met 
65 70 75 80 

ATC ATG TAC GTC GTA AGC CAG TGT GTT TCT GTG CGG CCC GTC CCG GGG 288 
lie Met Tyr Val Val Ser Gin Cys Val Ser Val Arg Pro Val Pro Gly 
85 90 95 

CGC GAT GGT ATG GCG CTC ATC TAC TTT GGA CAG TTT CTG GAG GAA GCA 336 
Arg Asp Gly Met Ala Leu He Tyr Phe Gly Gin Phe Leu Glu Glu Ala 
100 105 no 

TCC GGA CTG AGA TTT CCC TAC ATT GCT CCG CCG CCG TCG CGC GAA CAC 3 84 

Ser Gly Leu Arg Phe Pro Tyr He Ala Pro Pro Pro Ser Arg Glu His 
115 120 125 

GTA CCT GAC CTG ACC AGA CAA GAA TTA GTT CAT ACC TCC CAG GTG GTG 43: 
Val Pro Asp Leu Thr Arg Gin Glu Leu Val Kis Thr Ser Gin Val Val 
130 135 140 

CGC CGC GGC GAC CTG ACC AAT TGC ACT ATG GGT CTC GAA TTC AGG AAT 480 
Arg Arg Gly Asp Leu Thr Asn Cys Thr Met Gly Leu Glu Phe Arg Asn 
145 150 155 160 

GTG AAC CCT TTT GTT TGG CTC GGG GGC GGA TCG GTG TGG CTG CTG TTC 52 8 

Val Asn Pro Phe Val Trp Leu Gly Gly Gly Ser Val Trp Leu Leu Phe 
165 * 170 175 

TTG GGC GTG GAC TAC ATG GCG TTC TGT CCG GGT GTC GAC GGA ATG CCG 576 
Leu Gly Val Asp Tyr Met Ala Phe Cys Pro Gly Val Asp Gly Met Pro 
180 185 190 

TCG TTG GCA AGA GTG GCC GCC CTG CTT ACC AGG TGC GAC CAC CCA GAC 624 
Ser Leu Ala Arg Val Ala Ala Leu Leu Thr Arg Cys Asp His Pro Asp 
195 200 205 

TGT GTC CAC TGC CAT GGA CTC CGT GGA CAC GTT AAT GTA TTT CGT GGG 672 
Cys Val His Cys His Gly Leu Arg Gly His Val Asn Val Phe Arg Gly 
210 215 220 

TAC TGT TCT GCG CAG TCG CCG GGT CTA TCT AAC ATC TGT CCC TGT ATC 72 0 

Tyr Cys Ser Ala Gin Ser Pro Gly Leu Ser Asn He Cvs Pro Cys He 
225 230 235 * 240 

AAA TCA TGT GGG ACC GGG AAT GGA GTG ACT AGG GTC ACT GGA AAC AGA 76 8 

Lys Ser Cys Gly Thr Gly Asn Gly Val Thr Arg Val Thr Gly Asn Arg 
245 250 ~ 255 

AAT TTT CTG GGT CTT CTG TTC GAT CCC ATT GTC CAG AGC AGG GTA ACA 816 
Asn Phe Leu Glv Leu Leu Phe Asp Pro He Val Gin Ser Arg Val Thr 
260 265 270 

GCT CTG AAG ATA ACT AGC CAC CCA ACC CCC ACG CAC GTC GAG AAT GTG 864 
Ala Leu Lys lie Thr Ser His Pro Thr Pro Thr His Val Glu Asn Val 
275 280 285 

CTA ACA GGA GTG CTC GAC GAC GGC ACC TTG GTG CCG TCC GTC CAA GGC 912 
Leu Thr Gly Val Leu Asp Asp Gly Thr Leu Val Pro Ser Val Gin Gly 
290 * 295 300 

ACC CTG GGT CCT' CTT ACG AAT GTC TGA 93 9 

Thr Leu Gly Pro Leu Thr Asn Val 
305 310 

(2) INFORMATION FOR SEQ ID NO ill: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: .312 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Ala Ser Arg Arg Arg Lys Leu Arg Asn Phe Leu Asn Lys Glu Cys 
15 10 15 

He Trt) Thr Val Asn Pro Met Ser Glv Asp His He Lys Val Phe Asn 
2 0 2 5 3 0 

Ala Cys Thr Ser He Ser Pro Val Tyr Asp Pro Glu Leu Val Thr Ser 
35 40 * 45 

Tyr Ala Leu Ser Val Pro Ala Tyr Asn Val Ser Val Ala He Leu Leu 
50 55 60 

His Lys Val Met Gly Pro Cys Val Ala Val Gly He Asn Gly Glu Met 
65 70 75 " 80 

He Met Tyr Val Val Ser Gin Cys Val Ser Val Arg Pro Val Pro Gly 

85 90 95 

Arg Asp Gly Met Ala Leu He Tyr Phe Gly Gin Phe Leu Glu Glu Ala 
100 105 110 

Ser Gly Leu Arg Phe Pro Tyr He Ala Pro Pro Pro Ser Arg Glu His 
115 120 123 

Val Pro Asp Leu Thr Arg Gin Glu Leu Val His Thr Ser Gin Val Val 
130 135 140 

Arg Arg Gly Asp Leu Thr Asn Cys Thr Met Gly Leu Glu Phe Arg Asn 
145 150 155 160 

Val Asn Pro Phe Val Trp Leu Gly Gly Gly Ser Val Trp Leu Leu Phe 
165 170 175 

Leu Glv Val Asp Tyr Met Ala Phe Cys Pro Gly Val Asp Gly Met Pro 
180 IBS 190 

Ser Leu Ala Arg Val Ala Ala Leu Leu Thr Arg Cys Asp His Pro Asp 
195 200 205 

Cvs Val His Cys His Gly Leu Arg Gly His Val Asn Val Phe Arg Gly 
210 215 220 

Tvr Cvs Ser Ala Gin Ser Pro Gly Leu Ser Asn He Cys Pro Cys He 
*225 " 230 235 " 240 

Lvs Ser Cys Glv Thr Gly Asn Glv Val Thr Arg Val Thr Gly Asn Arg 
245 250 255 

Asn Phe Leu Gly Leu Leu Phe Asp Pro He Val Gin Ser Arg Val Thr 
260 265 270 

Ala Leu Lys He Thr Ser His Pro Thr Pro Thr His Val Glu Asn Val 
275 280 285 

Leu Thr Gly Val Leu Asp Asp Gly Thr Leu Val Pro Ser Val Gin Gly 
290 295 300 

Thr Leu Gly Pro Leu Thr Asn Val 
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305 



310 



(2) INFORMATION FOR SEQ ID NO : 12 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 86 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

tii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: N 



(iv) " ANTI- SENSE: N 



(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 1 . . 86 
(D) OTHER INFORMATION: 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 

ATG GAC TCA ACC AAC TCT AAA AGA GAG TTT ATT AAG TCG GCT CTG GAG 4 B 

Met Asp Ser Thr Asn Ser Lys Arg Glu Phe lie Lys Ser Ala Leu Glu 
1 5 10 15 

GCC AAC ATC AAC AGG AGG GCA GCT GTA TCG CTA TTT GA 6 6 

Ala Asn lie Asn Arg Arg Ala Ala Val Ser Leu Phe 
20 25 



{ 2 ) INFORMATION FOR SEQ ID NO : 13 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 amino acids 
{B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Asd Ser Thr Asn Ser Lys Arg Glu Phe He Lys Ser Ala Leu Glu 
1 5 10 15 

Ala Asr. He Asn Arg Arg Ala Ala Val Ser Leu Phe 
20 25 



(I) INFORMATION FOR SEQ ID NO: 14: 

(ii SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA {genomic) 



(iii) HYPOTHETICAL: N 
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(iv) ANTI- SENSE: N 

C ix ) FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION : 1. . 174 3 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO:l4 : 

ATG GCA GAA GGC GGT TTT GGA GCG GAr Trr r-r>~ r*r*~ ~~ 

Met Ala Glu Gly Gly P he Gly £2 2£ 22 ^ g£ £S 

10 ^ 
AAG GCC TCT GTG ACT AGG GGA GGC AGG Trr- rtr 

Ala Ser v.I Thr Arg ely Gly J£ £ ™ Gly £ SS 2? 

25 30 

GAC GAA TCA AGC ACC TCC ACA ACC AGC ACG rir a — ^ „ 

Asp Glu Ser se, Thr Ser Thr Thr J£ & ^ «C CCT 

40 45 



GAG GAG AGG AAA CCA CTA ACG GGA AAG TCT ptb 

Glu Glu A r g Lys p ro Leu Thr „* IS ™ ^ *« ™ S 



60 



% c S S3 S E E S S E E S S S 2 E E 
S E E S % S S £ E E s £ S £ E S 
S E S E K E S E S E E E 5 25 E S 
S E E E E E E §f E E S S SEES 
K E E E S E SJ S S S E S £ S E E 

■ LJ5 140 



E S 5 S S S S E E E E E S S S S 



155 



160 



3 S ESSE E E S E S E SEES 



170 17S 



If; t-:: ^ G GGG G3C AGT m *** Gcc cgt <=g A cgc gat gta gg- 

Ser Tar Lys Gly Gly Ser Leu Lys Al a ^ Gly Arg £ p 



"5 190 



GAC CGT CTC AGG GAC GGC GGC TTT GCC TTT AGT CT AGG GG- rr- aaa 
Asp Arg Leu Arg Asp Gly Gly Phe Ala Phe Ser Z£ £° L^ 

200 205 

III ?f 5 tT A AAC ATT TCA TGG TTG GGG ATC GGA GAA TCA 

Ser Ala He Gly Gin Asn lie Lys Ser Trp Leu Gly lie gTv G^ IS 

215 220 
TCG GCG ACT GCT GTC CCC GTC ACC ACG CAG CTT ATG GTA CCG GTG CAC 



46 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 
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Ser Ala Thr Ala Val Pro Val Thr Thr Gin Leu Met Val Pro Val His 
225 230 235 240 

CTC ATT AGA ACG CCT GTG ACC GTG GAC TAC AGG AAT GTT TAT TTG CTT 76 8 

Leu lie Arg Thr Pro Val Thr Val Asp Tyr Arg Asn Val Tyr Leu Leu 
245 250 255 

TAC TTA GAG GGG GTA ATG GGT GTG GGC AAA TCA ACG CTG GTC AAC GCC 816 
Ty- Leu Glu Gly Val Met. Gly Val Gly Lys Ser Thr Leu Val Asn Ala 
260 265 270 

GTG TGC GGG ATC TTG CCC CAG GAG AGA GTG ACA AGT TTT CCC GAG CCC 864 
Val Cys Gly lie Leu Pro Gin Glu Arg Val Thr Ser Phe Pro Glu Pro 
275 280 285 

ATG GTG TAC TGG ACG AGG GCA TTT ACA GAT TGT TAC AAG GAA ATT TCC 912 
M«t Val Tyr Trp Thr Arg Ala Phe Thr Asp Cys Tyr Lys Glu lie Ser 
2 90 2 95 3*0 0 

CAC CTG ATG AAG TCT GGT AAG GCG GGA GAC CCG CTG ACG TGT GCC AAA 96 0 

His Leu Met Lys Ser Glv Lys Ala Gly Asp Pro Leu Thr Ser Ala Lys 
305 " 31C 315 320 

ATA TAC TCA TGC CAA AAC AAG TTT TCG CTC CCC TTC CGG ACG AAC GCC 100B 
lie Tyr Ser Cvs Gin Asn Lys Phe Ser Leu Pro Phe Arg Thr Asn Ala 
325 330 335 

ACC GCT ATC CTG CGA ATG ATG CAG CCC TGG AAC GTT GGG GGT GGG TCT 10S6 
Thr Ala lie Leu Arg Met Met Gin Pro Trp Asn Val Gly Gly Gly Ser 
340 345 350 

GGG AGG GGC ACT CAC TGG TGC GTC TTT GAT AGG CAT CTC CTC TCC CCA 1104 
Gly Arg Gly Thr His Trp Cys Val Phe Asp Arg His Leu Leu Ser Pro 
355 360 365 

GCA GTG GTG TTC CCT CTC ATG CAC CTG AAG CAC GGC CGC CTA TCT TTT 1152 
Ala Val Val Phe Pro Leu Met His Leu Lys His Gly Arg Leu Ser Phe 
370 375 380 

GAT CAC TTC TTT CAA TTA CTT TCC ATC TTT AGA GCC ACA GAA GGC GAC 
Asp His Phe Phe Gin Leu Leu Ser lie Phe Arg Ala Thr Glu Gly Asp 
385 390 395 400 

GTG GTC GCC ATT CTC ACC CTC TCC AGC GCC GAG TCG TTG CGG CGG GTC 124 8 

Val Val Ala lie Leu Thr Leu Ser Ser Ala Glu Ser Leu Arg Arg Val 
405 410 415 

AGG GCG AGG GGA AGA AAG AAC GAC GGG ACG GTG GAG CAA AAC TAC ATC 12 96 

Arq Ala Arg Glv Arg Lys Asn Asp Gly Thr Val Glu Gin Asn Tyr lie 
42*0 425 430 

AGA GAA TTG GCG TGG GCT TAT CAC GCC GTG TAC TGT TCA TGG ATC ATG 1344 
Arg Glu Leu Ala Tro Ala .Tyr His Ala Val Tyr Cys Ser Trp He Met 
435 * 440 445 

TTG CAG TAC ATC ACT GTG GAG CAG ATG GTA CAA CTA TGC GTA CAA ACC 13 92 

Leu Gin Tyr He Thr Val Glu Gin Met Val Gin Leu Cys Val Gin Thr 
450 455 46C 

ACA AAT ATT CCG GAA ATC TGC TTC CGC AGC GTG CGC CTG GCA CAC AAG 1440 

Asn lie Pro Glu He Cvs Phe Arg Ser Val Arg Leu Ala His Lys 
465 * 470 " 475 480 

GAG GAA ACT TTG AAA AAC CTT CAC GAG CAG AGC ATG CTA CCT ATG ATC 14 88 

Glu Glu Thr Leu Lvs Asn Leu His Glu Gin Ser Met Leu Pro Met He 
485 490 495 



1200 
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ACC GGT GTA CTG GAT CCC GTG AGA CAT CAT CCC GTC GTG ATC GAG CTT 15 36 

Thr Gly Val Leu Asp Pro Val Arg His His Pro Val Val lie Glu Leu 
500 505 510 

TGC TTT TGT TTC TTC ACA GAG CTG AGA AAA TTA CAA TTT ATC GTA GCC 1564 
Cvs Phe Cys Phe Phe Thr Glu Leu Arg Lys Leu Gin Phe He Val Ala 
515 520 525 

GAC GCG GAT AAG TTC CAC GAC GAC GTA TGC GGC CTG TGG ACC GAA ATC 163 2 

Asp Ala Asp Lys Phe His Asp Asp Val Cys Gly Leu Trp Thr Glu He 
530 535 540 

TAC AGG CAG ATC CTG TCC AAT CCG GCT ATT AAA CCC AGG GCC ATC AAC 16 8 C 

Tyr Arg Gin He Leu Ser Asn Pro Ala He Lys Pro Arg Ala He Asn 
545 550 555 560 

TGG CCA GCA TTA GAG AGC CAG TCT AAA GCA GTT AAT CAC CTA GAG GAG 172S 
Trp Pro Ala Leu Glu Ser Gin Ser Lys Ala Val Asn His Leu Glu Glu 
565 570 575 

ACA TGC AGG GTC TAG 1743 
Thr Cvs Arg Val 
580 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 580 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Ala Glu Gly Gly Phe Gly Ala Asp Ser Val Gly Arg Gly Gly Glu 
I 5 10 15 

Lvs Ala Ser Val Thr Arg Gly Gly Arg Trp Asp Leu Gly Ser Ser Asp 
20 25 30 

Asp Glu Ser Ser Thr Ser Thr Thr Ser Thr Asp Met Asp Asp Leu Pro 
35 40 45 

Glu Glu Arg Lvs Pro Leu Thr Gly Lys Ser Val Lys Thr Ser Tyr He 
50 55 60 

Tv- Asp Val Pro Thr Val Pro Thr Ser Lys Pro Trp His Leu Met His 
65 70 75 80 

Asp Asn Ser Leu Tyr Ala Thr Pro Arg Phe Pro Pro Arg Pro Leu He 

85 90 95 

Arg His Pro Ser Glu Lys Gly Ser He Phe Ala Ser Arg Leu Ser Ala 
100 105 HO 

Thr Asu Aso Aso Ser Gly Asd Tvr Ala Pro Met Asp Arg Phe Ala Phe 
* 115 . * 120 125 

Gin Ser Pro Arg Val Cvs Gly Arg Pro Pro Leu Pro Pro Pro Asn His 
130 * 135 , 140 

Pro Pro Pro Ala Thr Arg Pro Ala Asp Ala Ser Met Gly Asp Val Gly 
145 150 155 160 

Trp Ala Asp Leu Gin Gly Leu Lys Arg Thr Pro Lys Gly Phe Leu Lys 
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165 170 175 

Thr Ser Thr Lys Gly Gly Ser Leu Lys Ala Arg Gly Arg Asp Val Gly 
180 185 19*0 

Asp Arg Leu Arg Asp Gly Gly Phe Ala Phe Ser Pro Arg Gly Val Lys 
195 200 205 

Ser Ala lie Gly Gin Asn lie Lys Ser Trp Leu Gly lie Gly Glu Ser 
210 215 220 

Ser Ala Thr Ala Val Pro Val Thr Thr Gin Leu Met Val Pro Val His 
225 230 235 240 

Leu lie Arg Thr Pro Val Thr Val Asp Tyr Arg Asn Val Tyr Leu Leu 
245 250 255 

Tyr Leu Glu Gly Val Met Gly Val Gly Lys Ser Thr Leu Val Asn Ala 
260 265 270 

val Cvs Glv lie Leu Pro Gin Glu Arg Val Thr Ser Phe Pro Glu Pro 
275 280 285 

Met Val Tvr Trp Thr Arg Ala Phe Thr Asp Cys Tvr Lys Glu lie Ser 
290 * 295 300 

His Leu Met Lys Ser Gly Lys Ala Gly Asp Pro Leu Thr Ser Ala Lys 
305 310 315 320 

lie Tvr Ser Cys Gin Asn Lys Phe Ser Leu Pro Phe Arg Thr Asn Ala 
325. 330 335 

Thr Ala lie Leu Arg Met Met Gin Pro Trp Asn Val Gly Gly Gly Ser 
340 345 350 

Glv Arg Gly Thr His Trp Cys Val Phe Asp Arg His Leu Leu Ser Pro 
355 360 365 

Ala Val Val Phe Pro Leu Met His Leu Lys His Gly Arg Leu Ser Phe 
370 375 3B0 

Asp His Phe Phe Gin Leu Leu Ser lie Phe Arg Ala Thr Glu Gly Asp 
385 390 395 400 

Val Val Ala lie Leu Thr Leu Ser Ser Ala Glu Ser Leu Arg Arg Val 
405 410 415 

Arg Ala Arg Gly Arg Lys Asn Asp Gly Thr Val Glu Gin Asn Tyr lie 
420 425 430 

Arg Glu Leu Ala Trp Ala Tyr His Ala Val Tyr Cys Ser Trp lie Met 
435 440 445 

Leu Gin Tyr lie Thr Val Glu Gin Met Val Gin Leu Cys Val Gin Thr 
450 455 460 

Thr Asr. He Pro Glu He Cys Phe Arg Ser Val Arg Leu Ala His Lys 
465 470 475 480 

Glu Glu Thr Leu Lys Asn Leu His Glu Gin Ser Met Leu Pro Met He 
485 490 495 

Thr Gly Val Leu Asp Pro Val Arg His His Pro Val Val He Glu Leu 
500 505 510 

Cys Phe Cys Phe Phe Thr Glu Leu Arg Lys Leu Gin Phe He Val Ala 
515 520 525 
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£. Asp L ys His Asp Asp Va! Cys 8l y Lju Trp Tfcr CXu He 

^ Z c ix. «« |« - *~ 116 & "° Ar9 A " slo 

Z Pro AXa Leu Glu ser SXn Se, L-ys AX. VaX Asn His - ttu Glu 

565 



Thr Cys Arg Val 
580 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2193 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 

MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: N 
(iv) ANTI- SENSE: N 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 1. .2193 
(D) OTHER INFORMATION: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

51? £?. S S S ~ ™ S S £= S 5 5 S S S 
S 22 S 51 S E S S S S S S S S S 

s s s is s js s ss s SI 5 e jjs s ~ 

35 4 



» --^ ^ «T«r- rTA AAT TGG C AA AAT 
MA GAA CTG GAA TTC £C GGC ACT M TTT ^ CxA ^ ^ ^ ^ 

He Glu Leu Glu Pne Asn biy gQ 



55 

50 



S S JS S? £ « S5 S E S S - - 5 S E 
« ffi E S SS K 55 SI SS S SS w - S SB - 

85 

~ w r-T"- TC^ GGA GAC GGC CAT CGC 

Si S JS £ - £S £ fa § & - •» - 8$ - - 

- s 1 S s s s i = - S 3 E s s s: 

«= TTT TAT •= C=T =T» CCO = " ^ " = 
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Gly Phe Asn Tyr Ser Ala Leu Pro Gly His Leu Gly Gly Phe Gly lie 
130 135 140 

AAC GCG CGT CTG GTA CTG GGT GAT ATC TTC GCA TCA AAA TGG TCG CTA 48C 
Asn Ala Arg Leu Val Leu Gly Asp lie Phe Ala Ser Lys Trp Ser Leu 
145 150 155 160 

TTC GCG AGG GAC ACC CCA GAG TAT CGG GTG TTT TAC CCA ATG AAT GTC 52 B 

Phe Ala Arg Asp Thr Pro Glu Tyr Arg Val Phe Tyr Pro Met Asn Val 
165 170 175 

ATG GCC GTC AAG TTT TCC ATA TCC ATT GGC AAC AAC GAG TCC GGC GTA 57 € 

Met Ala Val Lys Phe Ser lie Ser lie Gly Asn Asn Glu Ser Gly Val 
180 185 190 

GCG CTC TAT GGA GTG GTG TCG GAA GAT TTC GTG GTC GTC ACG CTC CAC 624 
Ala Leu Tvr Glv Val Val Ser Glu Asp Phe Val Val Val Thr Leu His 
195 " 200 205 

AAC AGG TCC AAA GAG GCT AAC GAG ACG GCG TCC CAT CTT CTG TTC GGT 6 72 

Asn Arg Ser Lys Glu Ala Asn Glu Thr Ala Ser His Leu Leu Phe Gly 
210 215 220 

CTC CCG GAT TCA CTG CCA TCT CTG AAG GGC CAT GCC ACC TAT GAT GAA 72 0 

Leu Pro Asp Ser Leu Pro Ser Leu Lys Glv His Ala Thr Tyr Asp Glu 
225 230 235 240 

CTC ACG TTC GCC CGA AAC GCA AAA TAT GCG CTA GTG GCG ATC CTG CCT 76 8 

Leu Thr Phe Ala Arg Asn Ala Lys Tyr Ala Leu Val Ala lie Leu Pro 
245 250 255 

AAA GAT TCT TAC CAG ACA CTC CTT ACA GAG AAT TAC ACT CGC ATA TTT 816 
Lvs Asp Ser Tyr Gin Thr Leu Leu Thr Glu Asn Tyr Thr Arg lie Phe 
260 265 270 

CTG AAC ATG ACG GAG TCG ACG CCC CTC GAG TTC ACG CGG ACG ATC CAG 864 
Leu Asn Met Thr Glu Ser Thr Pro Leu Glu Phe Thr Arg Thr lie Gin 
275 280 285 

ACC AGG ATC GTA TCA ATC GAG GCC AGG CGC GCC TGC GCA GCT CAA GAG 912 
Thr Arg lie Val Ser lie Glu Ala Arg Arg Ala Cys Ala Ala Gin Glu 
29 0 295 300 

GCG GCG CCG GAC ATA TTC TTG GTG TTG TTT CAG ATG TTG GTG GCA CAC 96 0 

Ala Ala Pro Asp He Phe Leu Val Leu Phe Gin Met Leu Val Ala His 
305 310 315 320 

TTT CTT GTT GCG CGG GGC ATT GCC GAG CAC CGA TTT GTG GAG GTG GAC 10 0 6 

Phe Leu Val Ala Arg Glv lie Ala Glu His Arg Phe Val Glu Val Asp 
325 " 330 335 

TGC GTG TGT CGG CAG TAT GCG GAA CTG TAT TTT CTC CGC CGC ATC TCG 1056 
Cvs Val Cvs Arg Gin Tyr Ala Glu Leu Tyr Phe Leu Arg Arg He Ser 
340 345 350 

CGT CTG TGC ATG CCC ACG TTC ACC ACT GTC GGG TAT AAC CAC ACC ACC 1104 
Ara Leu Cvs Met Pro Thr Phe Thr Thr Val Gly Tyr Asn His Thr Thr 
355 360 365 

C-T GGC GCT GTG GCC GCC ACA CAA ATA GCT CGC GTG TCC GCC ACG AAG 1152 
Leu Glv Ala Val Ala Ala Thr Gin lie Ala Arg Val Ser Ala Thr Lys 
370 375 380 

TTG GCC AGT TTG CCC CGC TCT TCC CAG GAA ACA GTG CTG GCC ATG GTC 1200 
Leu Ala Ser Leu Pro Arg Ser Ser Gin Giu Thr Val Leu Ala Met Val 
385 390 395 400 
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CAG CTT GGC GCC CGT GAT GGC GCC GTC CCT TCC TCC ATT CTG GAG GGC 124 8 

-Gin Leu- Gly Ala Arg Asp Gly -Ala Val Pro Ser Ser lie Leu Glu Gly 
405 410 415 

ATT GCT ATG GTC GTC GAA CAT ATG TAT ACC GCC TAC ACT TAT GTG TAC 12 96 

He Ala Met Val Val Glu His Met Tyr Thr Ala Tyr Thr Tyr Val Tyr 

420 425 430 

ACA CTC GGC GAT ACT GAA AGA AAA TTA ATG TTG GAC ATA CAC ACG GTC 1344 

Thr Leu Gly Asp Thr Glu Arg Lys Leu Met Leu Asp He His Thr Val 
435 440 445 

CTC ACC GAC AGC TGC CCG CCC AAA GAC TCC GGA GTA TCA GAA AAG CTA 13 92 

Leu Thr Asp Ser Cys Pro Pro Lys Asp Ser Gly Val Ser Glu Lys Leu 
450 -455 460 

CTG AGA ACA TAT TTG ATG TTC ACA TCA ATG TGT ACC AAC ATA GAG CTG ' 144 C 

Leu Arg Thr Tyr Leu Met Phe Thr Ser Met Cys Thr Asn He Glu Leu 
465 470 475 480 

GGC GAA ATG ATC GCC CGC TTT TCC AAA CCG GAC AGC CTT AAC ATC TAT 14 8 8 

Glv Glu Met He Ala Arg Phe Ser Lys Pro Asp Ser Leu Asn He Tyr 
485 490 495 

AGG GCA TTC TCC CCC TGC TTT CTA GGA CTA AGG TAC GAT TTG CAT CCA 153 6 

Arg Ala Phe Ser Pro Cys Phe Leu Gly Leu Arg Tyr Asp Leu His Pro 

500 505 510 

GCC AAG TTG CGC GCC GAG GCG CCG CAG TCG TCC GCT CTG ACG CGG ACT 15 84 

Ala Lys Leu Arg Ala Glu Ala Pro Gin Ser Ser Ala Leu Thr Arg Thr 
515 520 525 

GCC GTT GCC AGA GGA ACA TCG GGA TTC GCA GAA TTG CTC CAC GCG CTG 1632 

Ala Val Ala Arg Gly Thr Ser Gly Phe Ala Glu Leu Leu His Ala Leu 
. 530 535 540 

CAC CTC GAT AGC TTA AAT TTA ATT CCG GCG ATT AAC TGT TCA AAG ATT 16 80 

His Leu Asp Ser Leu Asn Leu He Pro Ala He Asn Cys Ser Lys He 
545 S50 555 560 

ACA GCC GAC AAG ATA ATA GCT ACG GTA CCC TTG CCT CAC GTC ACG TAT 1728 

Thr Ala Asp Lys He He Ala Thr Val Pro Leu Pro His Val Thr Tyr 
565 570 575 

ATC ATC AGT TCC GAA GCA CTC TCG AAC GCT GTT GTC TAC GAG GTG TCG 1776 

He He Ser Ser Glu Ala Leu Ser Asn Ala Val Val Tyr Glu Val Ser 

580 585 590 

GAG ATC TTC CTC AAG AGT GCC ATG TTT ATA TCT GCT ATC AAA CCC GAT 1824 

Glu He Phe Leu Lys Ser Ala Met Phe He Ser Ala lie Lys Pro Asp 
595 600 605 

TGC TCC GGC TTT AAC TTT TCT CAG ATT GAT AGG CAC ATT CCC ATA GTC 18 72 

Cvs Ser Glv Phe Asn Phe Ser Gin He Asp Arg His He Pro lie Val 
610 " 615 620 

TAC AAC ATC AGC ACA CCA AGA AGA GGT TGC CCC CTT TGT GAC TCT GTA 192 0 

Tvr Asn He Ser Thr Pro Arg Arg Gly Cys Pro Leu Cys Asp Ser Val 
625 630 635 640 

ATC ATG AGC TAC GAT GAG AGC GAT GGC CTG CAG TCT CTC ATG TAT GTC 196 8 

He Met Ser Tyr Asp Glu Ser Asu Gly Leu Gin Ser Leu Met Tyr Val 
645 650 655 

ACT AAT GAA AGG GTG CAG ACC AAC CTC TTT TTA GAT AAG TCA CCT TTC 2 016 

Thr Asn Glu Arg Val Gin Thr Asn Leu Phe Leu Asp Lys Ser Pro Phe 

660 6£5 670 
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Asn Arg Ser Lys Glu Ala Asn Glu Thr Ala Ser His Leu Leu Phe Gly 

210 215 220 

Leu Pro Asp Ser Leu Pro Ser Leu Lys Gly His Ala Thr Tvr Asp Glu 
225 230 235 240 

Leu Thr Phe Ala Arg Asn Ala Lys Tyr Ala Leu Val Ala He Leu Pro 
245 250 255 

Lvs Asr> Ser Tyr Gin Thr Leu Leu Thr Glu Asn Tyr Thr Arg He Phe 
260 265 270 

Leu Asn Met Thr Glu Ser Thr Pro Leu Glu Phe Thr Arg Thr He Gin 
275 280 285 

Thr Arc lie Val Ser He Glu Ala Arg Arg Ala Cys Ala Ala Gin Glu 
290 295 300 

Ala Ala Pro Asp He Phe Leu Val Leu Phe Gin Met Leu Val Ala His 
305 310 315 320 

Phe Leu Val Ala Arg Gly He Ala Glu His Arg Phe Val Glu Val Asp 
325 330 335 

Cys Val Cys Arg Gin Tyr Ala Glu Leu Tyr Phe Leu Arg Arg He Ser 
340 345 350 

Arg Leu Cys Met Pro Thr Phe Thr Thr Val Gly Tyr Asn His Thr Thr 
355 360 " 365 

Leu Gly Ala Val Ala Ala Thr Gin He Ala Arg Val Ser Ala Thr Lys 
370 375 380 

Leu Ala Ser Leu Pro Arg Ser Ser Gin Glu Thr Val Leu Ala Met Val 
385 390 395 400 

Gin Leu Gly Ala Arg Asp Gly Ala Val Pro Ser Ser He Leu Glu Gly 
405 410 415 

He Ala Met Val Val Glu His Met Tyr Thr Ala Tyr Thr Tyr Val Tyr 
420 425 430 

Thr Leu Gly Asp Thr Glu Arg Lys Leu Met Leu Asp He His Thr Val 
435 * 440 445 

Leu Thr Asp Ser Cys Pro Pro Lys Asp Ser Gly Val Ser Glu Lys Leu 
450 455 460 

Leu Arg Thr Tyr Leu Met Phe Thr Ser Met Cvs Thr Asn He Glu Leu 
455 470 475 480 

Gly Glu Met He Ala Arg Phe Ser Lys Pro Asp Ser Leu Asn He Tyr 
485 490 495 

Arg Ala Phe Ser Pro Cys Phe Leu Gly Leu Arg Tyr Asp Leu His Pro 
500 505 510 

Ala Lys Leu Arg Ala Glu Ala Pro Gin Ser Ser Ala Leu Thr Arg Thr 
515 520 525 

Ala Val Ala Arg Gly Thr Ser Gly Phe Ala Glu Leu Leu His Ala Leu 
530 535 540 

His Leu Asp Ser Leu Asn Leu He Pro Ala He Asn Cys Ser Lys He 
545 550 555 560 

Thr Ala Asp Lys He He Ala Thr Val Pro Leu Pro His Val Thr Tyr 



WO 96/06159 



PCTVUS95/10194 



217 

Leu Phe Glu Lys Phe Gly Thr Pro Asp Ser Ser Thr Leu Pro Leu Tyr 

50 ■ 55 - 60 

GCG GCT AGG CAC CCG GAA CTA TCG TTG CTA CGG ATC ATG CTC TCA CCG 24 0 

Ala Ala Arg His Pro Glu Leu Ser Leu Leu Arg lie Met Leu Ser Pro 
6 5 7 0 75 8 0 

CAC CCC TAC GCG TTA AGA AGC CAC TTG TGC GTA GGC GAA GAG ACC GCA 28 8 

His Pro Tyr Ala Leu Arg Ser His Leu Cys Val Gly Glu Glu Thr Ala 

85 90 95 

TCT CTT GGC GTT TAC CTG CAC TCC AAG CCA GTC GTA CGC GGC CAC GAA 3 36 

Ser Leu Glv Val Tyr Leu His Ser Lys Pro Val Val Arg Gly His Glu 
100 105 110 

TTC GAG GAC ACG CAG ATA CTA CCG GAG TGC CGG CTG GCC ATA ACG AGC 3 84 

Phe Glu Asp Thr Gin lie Leu Pro Glu Cys Arg Leu Ala lie Thr Ser 
115 120 125 

GAC CAG TCT TAT ACC AAC TTT AAG ATT ATA GAT CTG CCA GCG GGA TGC 43 2 

Asd Gin Ser Tyr Thr Asn Phe Lys lie lie Asp Leu Pro Ala Gly Cys 
130 135 140 

CGT CGC GTC CCC ATA CAC GCC GCG AAC AAG CGT GTC GTC ATC GAC GAG 48 0 

Arg Arg Val Pro lie His Ala Ala Asn Lys Arg Val Val lie Asp Glu 
145 150 155 160 

GCC GCC AAC CGC ATA AAG GTG TTT GAC CCA GAG TCG CCT TTA CCG CGT 52 8 

Ala Ala Asn Arg lie Lys Val Phe Asp Pro Glu Ser Pro Leu Pro Arg 
165 170 175 

CAC CCC ATA ACA CCC CGT GCC GGT CAG ACC AGA TCT ATA CTG AAA CAC 576 
His Pro lie Thr Pro Arg Ala Gly Gin Thr Arg Ser lie Leu Lys His 
180 185 190 

AAC ATC GCA CAG GTT TGC GAA CGG GAT ATC GTG TCA CTT AAC ACA GAC 624 
Asn lie Ala Gin Val Cys Glu Arg Asp lie Val Ser Leu Asn Thr Asp 
195 200 205 

AAC GAG GCC GCG TCT ATG TTC TAC ATG ATT GGA CTC AGG CGG CCG AGA 672 
Asn Glu Ala Ala Ser Met Phe Tyr Met He Gly Leu Arg Arg Pro Arg 
210 215 220 

CTC GGA GAA AGC CCG GTC TGT GAC TTC AAC ACC GTT ACC ATC ATG GAG 72 0 

Leu Glv Glu Ser Pro Val Cys Asp Phe Asn Thr Val Thr He Met Glu 
225 ' 230 235 240 

CGT GCT AAC AAC TCG ATA ACT TTT CTA CCC AAG CTA AAA CTG AAC CGG 76 8 

Arg Ala Asn Asn Ser He Thr Phe Leu Pro Lys Leu Lys Leu Asn Arg 
245 250 2'55 

CTA CAA CAC CTG TTC CTG AAG CAC GTG TTG CTG CGC AGC ATG GGG CTG 816 
Leu Gin His Leu Phe Leu Lys His Val Leu Leu Arg Ser Met Gly Leu 
260 265 270 

GAA AAC ATC GTG TCG TGT TTC TCA TCG CTG TAC GGC GCA GAA CTT GCC 864 
Glu Asn He Val Ser Cys Phe Ser Ser Leu Tyr Glv Ala Glu Leu Ala 
275 280 285 

CCT GCG AAA ACA CAC GAG CGG GAG TTC TTC GGC GCT CTG CTA GAA AGA 912 
Pro Ala Lys Thr His Glu Arg Glu Phe Phe Gly Ala Leu Leu Glu Arg 
290 295 300 

CTC AAA CGT CGG GTG GAG GAC GCG GTC TTC TGC CTG AAT ACC ATA GAG 96 0 

Leu Lys Arg Arg Val Glu Asp Ala Val Phe Cys Leu Asn Thr He Glu 
305 310 315 320 
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GAT TTC CCG TTT AGG GAA CCC ATT CGC CAA CCC CCA GAT TGT TCC AAG 100 6 

Ast> Phe Pro Phe Arg Glu Pro lie Arg Gin Pro Pro Asp Cys Ser Lys 

325 330 335 

GTG CTT ATA GAA GCC ATG GAA AAG TAC TTT ATG ATG TGT AGC CCC AAA 10 56 

Val Leu lie Glu Ala Met Glu Lys Tyr Phe Mer Met Cvs Ser Pro Lys 

340 345 350 

GAC CGT CAA AGC GCC GCA TGG CTA GGT GCA GGG GTG GTC GAA CTG ATA 1104 

Asp Arg Gin Ser Ala Ala Trp Leu Gly Ala Gly Val Val Glu Leu lie 

355 360 365 

TGT GAC GGC AAT CCA CTT TCT GAG GTG CTC GGA TTT CTT GCC AAG TAT 1152 

Cys Asp Gly Asn Pro Leu Ser Glu Val Leu Gly Phe Leu Ala Lys Tyr 
370 " 375 380 

ATG CCC ATA CAA AAA GAA TGC ACA GGA AAC CTT TTA AAA ATC TAC GCT 12 0 0 

Met Pro lie Gin Lys Glu Cys Thr Gly Asn Leu Leu Lys lie Tyr Ala 

385 390 395 ' 400 

TTA TTG ACC GTC TAA 1215 
Leu Leu Thr Val 



"(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 404 amino acids 
{B) TYPE: amino acid 
<D> TOPOLOGY: linear 

{ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Leu Arg Val Pro Asp Val Lys Ala Ser Leu Val Glu Gly Ala Ala 
15 10 15 

Arg Leu Ser Thr Gly Glu Arg Val Phe His Val Leu Thr Ser Pro Ala 
20 25 30 

Val Ala Ala Met Val Gly Val Ser Asn Pro Glu Val Pro Met Pro Leu 
35 40 45 

Leu Phe Glu Lys Phe Gly Thr Pro Asp Ser Ser Thr Leu Pro Leu Tyr 
50 55 60 

Ala Ala Arg His Pro Glu Leu Ser Leu Leu Arg lie Met Leu Ser Pro 
65 70 75 80 

His Pre Tvr Ala Leu Arg Ser His Leu Cys Val Gly Glu Glu Thr Ala 

85 90 95 

Ser Leu Gly Val Tyr Leu His Ser Lys Pro Val Val Arg Gly His Glu 
100 105 110 

Phe Glu Asp Thr Gin He Leu Pro Glu Cys Arg Leu Ala He Thr Ser 
115 120 125 

Asp Gin Ser Tyr Thr Asn Phe Lys He He Asp Leu Pro Ala Gly Cys 
130 135 140 

Arg Arg Val Pro He His Ala Ala Asn Lys Arg Val Val He Asp Glu 
145 150 155 160 

Ala Ala Asn Arg He Lys Val Phe Asp Pro Glu Ser Pro Leu Pro Arg 
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ixi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 0 : 

ATG GCA GCG CTC GAG GGC CCC CTA CTA CTG CCA CCG AGC GCC TCC CTG 4 8 

Me- Ala Ala Leu Glu Gly Pro Leu Leu Leu Pro Pro Ser Ala Ser Leu 
I ' 5 10 15 

ACG ACG AGT CCG CAG ACC ACG TGT TAT CAA GCG ACT TGG GAA TCA CAG 96 
Th~ ^hr Ser Pro Gin Thr Thr Cys Tyr Gin Ala Thr Trp Glu Ser Gin 
20 25 30 

CTG GAA ATA TTC TGC TGT CTG GCC ACC AAC TCG CAC CTG CAG GCA GAG 144 
Leu Glu lie Phe Cvs Cys Leu Ala Thr Asn Ser His Leu Gin Ala Glu 
35 * 40 45 

CTG ACC TTA GAA GGT CTT GAT AAG ATG ATG CAG CCC GAG CCC ACC TTT 192 
Leu Thr Leu Glu Gly Leu Asp Lys Met Met Gin Pro Glu Pro Thr Phe 
50 55 60 

TTC GCC TGC AGA GCG ATA CGC AGA CTA CTC CTG GGG GAA CGC CTC CAC 24 0 

Phe Ala Cvs Arg Ala lie Arg Arg Leu Leu Leu Gly Glu Arg Leu His 
65 * 70 75 80 

CCT TTT ATA CAT CAA GAA GGG ACT CTT TTG GGA AAA GTG GGT CGA CGG 288 
Pro Phe lie Kis Gin Glu Glv Thr Leu Leu Gly Lvs Val Gly Arg Arg 
85 90 95 

TAC AGC GGC GAA GGT TTA ATA ATT GAC GGT GGT GGA GTG TTT ACG CGC 3 36 

Tyr Ser Gly Glu Gly Leu lie lie Asp Gly Gly Gly Val Phe Thr Arg 
100 105 110 

GGA CAG ATA GAC ACC GAC AAC TAC CTA CCT GCG GTG GGA TCA TGG GAA 3 84 

Glv Gin He Asp Thr Asp Asn Tyr Leu Pro Ala Val Gly Ser Trp Glu 
115 120 125 

CTT ACC GAT GAT TGT GAT AAA CCC TGC GAA TTC AGG GAG CTA CGC TCG 43 2 

Leu Thr Asp Asp Cys Asp Lys Pro Cys Glu Phe Arg Glu Leu Arg Ser 
130 135 140 

CTG TAT CTT CCC GCG CTA CTA ACG TGC ACC ATA TGT TAC AAA GCC ATG 4 80 

Leu Tvr Leu Pro Ala Leu Leu Thr Cys Thr He Cys Tyr Lys Ala Met 
145 " 150 155 160 

TTC AGG ATA GTG TGC AGG TAC CTG GAG TTC TGG GAG TTC GAA CAG TGT 52 8 

Phe Arc lie Val Cvs Arg Tyr Leu Glu Phe Trp Glu Phe Glu Gin Cys 
165 170 175 

TTT CAT GCG TTT CTG GCG GTG TTG CCC CAT AGT CTA CAA CCC ACA ATC . 576 

Ph« His Ala Phe Leu Ala Val Leu Pro His Ser Leu Gin Pro Thr He 
180 185 190 

TAT CAA AAT TAT TTT GCA CTC CTG GAG AGC CTG AAG CAT CTC TCG TTT 624 
Tyr Gin Asn Tvr Phe Ala Leu Leu Glu Ser Leu Lys His Leu Ser Phe 
195 * 200 205 

TCA ATA ATG CCA CCC GCA TCC CCA GAC GCA CAG CTA CAT TTT TTA AAG 6 72 

Ser lie Me: Pro Pro Ala Ser Pro Asp Ala Gin Leu His Phe Leu Lys 

21C 215 220 

AAC ATC AGC AGC TTC ATG GCC ACG TGG GGG TGG CAC GGA GAG CTG 72 0 

Ph* Asn He Ser Ser Phe Met Ala Thr Trp Gly Trp His Gly Glu Leu 
225 230 235 240 

GTC TCG CTG CGC CGT GCC ATC GCT CAC AAC GTA GAG CGA CTG CCC ACC 76 8 

Val Ser Leu Arg Arg Ala He Ala His Asn Val Glu Arg Leu Pro Thr 
245 250 255 

GTG CTG AAG AAC CTG TCG AAA CAG AGT AAG CAC CAG GAC GTC AAG GTT 816 
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Val Leu Lys Asn Leu Ser Lys Gin Ser Lys His Gin Asp Val Lvs Val 

" 260 265 270 

AAC GGA CGG GAT CTG GTG GGC TTT CAG CTG GCT CTA AAC CAG CTC GTG 8 64 

Asn Glv Arg Asc Leu Val Gly Phe Gin Leu Ala Leu Asn Gin Leu Val 

* 275 " 280 285 

TCC CGT CTG CAC GTA AAA ATC CAA CGC AAG GAC CCC GGA CCA AAG CCA 912 

Ser Arg Leu His Val Lvs lie Gin Arg Lys Asp Pro Gly Pro Lys Pro 

290 " 295 300 



TAC AGG GTG GTC GTC AGT ACC CCA GAT TGT ACC TAC TAT CTA GTG TAT 
T vr Ara Val Val Val Ser Thr Pro Asp Cys Thr Tyr Tyr Leu Val Tyr 
305 " 310 315 320 



96 0 



CCG GGC ACA CCG GCC ATC TAC AGA CTC GTC ATG TGT ATG GCA GTG GCA 10 0 8 

Pro Gly Thr Pro Ala lie Tyr Arg Leu Val Met Cys Men Ala Val Ala 
325 330 335 

GAC TGC ATC GGC CAC TCG TGC AGC GGA CTG CAC CCC TGC GCA AAC TTT 1056 
Asp Cvs lie Gly His Ser Cys Ser Gly Leu His Pro Cys Ala Asn Phe 
340 345 350 

TTA GGC ACC CAC GAG ACA CCG CGT CTC CTG GCG GCG ACG CTT TCA AGA 1104 
Leu Gly Thr His Glu Thr Pro Arg Leu Leu Ala Ala Thr Leu Ser Arg 
355 360 . 365 

ATC CGG TAC GCG CCG AAA GAC CGG CGA GCA GCC ATG AAA GGA AAT TTG 1152 
lie Arg Tyr Ala Pro Lys Asp Arg Arg Ala Ala Met Lys Gly Asn Leu 
370 375 380 

CAG GCG TGC TTC CAA CGA TAC GCG GCC ACG GAC GCG CGG ACT CTG GGC 12 00 

Gin Ala Cys Phe Gin Arg Tyr Ala Ala Thr Asp Ala Arg Thr Leu Gly 
3B5 390 395 400 

AGC TCT ACA GTG TCA GAC ATG CTG GAA CCC ACA AAA CAC GTC AGT TTG 124 8 

Ser Ser Thr Val Ser Asp Met Leu Glu Pro Thr Lys His Val Ser Leu 
405 410 415 

GAA AAC TTC AAG ATC ACC ATA TTC AAC ACC AAC ATG GTG ATT AAC ACT 12 96 

Glu Asn Phe Lvs lie Thr lie Phe Asn Thr Asn Met Val lie Asn Thr 
420 425 430 

AAG ATA AGC TGC CAC GTT CCT AAC ACC CTG CAA AAG ACT ATT TTA AAC 1344 
Lys He Ser Cys His Val Pro Asn Thr Leu Gin Lys Thr He Leu Asn 
435 440 445 

ATC CCC AGA TTG ACC AAC AAT TTT GTT ATA CGA AAG TAC TCC GTA AAG 13 92 

He Pro Arg Leu Thr Asn Asn Phe Val He Arg Lys Tyr Ser .Val Lys 
450 455 460 

GAA CCT TCT TTT ACC ATA AGC GTG TTT TTT TCC GAC AAC ATG TGT CAA 144 0 

Glu Pro Ser Phe Thr He Ser Val Pne Phe Ser Asp Asn Met Cys Gin 
465 470 475 480 

GGC ACC GCA ATA AAC ATC AAC ATC AGT GGG GAC ATG CTG CAC TTT CTC 14 8 8 

Gl — A ^ a Ile Asn n e He Ser Gly Asp Met Leu His Phe Leu 

485 490 495 

TTC GCA ATG GGT ACG CTG AAA TGC TTT CTG CCA ATC AGG CAC ATA TTT 15 3 6 

Phe Ala M«t Gly Thr Leu Lys Cvs Phe Leu Pro He Arg His He Phe 
500 505 510 

CCT GTA TCG ATA GCA AAT TGG AAC TCC ACG TTG GAC CTG CAC GGA CTG 1584 
Pro Val Ser He Ala Asn Trp Asn Ser Thr Leu Asp Leu His Gly Leu 
515 520 525 
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CAG ACA CCG CCA GCC GAT CCC CGC GTA CGT TGC GTC CGT CCG CTG GTC 
Gin Thr Pro Pro Ala Asp Pro Arg Val Arg Cys Val Arg Pro Leu Val 
740 745 750 



177£ 



1824 
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GAA AAC CAG TAC ATG GTG AGA ATG GGG CGA AAA AAC GTA TTT TGG ACC 16 3 Z 

Glu Asn Gin Tyr Met: Val Arg Met Gly Arg Lys Asn Val Phe Trp Thr 
530 535 540 

ACA AAC TTT CCA TCT GTG GTC TCC AGC AAG GAT GGG CTA AAC GTG TCC 16 8 0 

Th^ Asn Phe Pro Ser Val Val Ser Ser Lys Asp Gly Leu Asn Val Ser 
54*5 550 555 560 

TGG TT AAG GCC GCG ACA GCC ACG ATT TCT AAA GTG TAC GGG CAG CCT 172 8 

T-d Phe Lvs Ala Ala Thr Ala Thr lie Ser Lys Val Tyr Gly Gin Pro 
565 570 575 

CTT GTG GAA CAG ATT CGC CAC GAG CTG GCG CCC ATT CTC ACG GAC CAG 
Leu Val Glu Gin He Arg His Glu Leu Ala Pro He Leu Thr Asp Gin 
5B0 585 590 

CAC GCG CGC ATC GAC GGA AAC AAA AAT AGA ATA TTC TCC CTA CTT GAG 
His Ala Arg He Asp Gly Asn Lys Asn Arg He Phe Ser Leu Leu Glu 
595 " 600 605 

CAC AGA AAC CGT TCC CAA ATA CAG ACG CTA CAC AAA AGG TTC CTG GAG 18 72 

His Ara Asn Arg Ser Gin He Gin Thr Leu His Lys Arg Phe Leu Glu 
610 615 620 

TGT CTG GTG GAA TGC TGT TCG TTT CTC AGG CTT GAC GTG GCT TGC ATT 192 0 

Cvs Leu Val Glu Cvs Cvs Ser Phe Leu Arg Leu Asp Val Ala Cys He 
625 * 630 635 640 

AGG CGA GCC GCC GCC CGG GGC CTG TTT GAC TTC TCA AAG AAG ATA ATC 196 8 

Arq Arg Ala Ala Ala Arg Gly Leu Phe Asp Phe Ser Lys Lys lie He 
645 650 655 

AGT CAC ACT AAA AGC AAA CAC GAG TGC GCA GTA CTG GGA TAT AAA AAG 2 016 

Ser His Thr Lys Ser Lys His Glu Cys Ala Val Leu Gly Tyr Lys Lys 
660 665 670 

TGT AAC CTA ATC CCG AAA ATC TAT GCC CGA AAC AAG AAG ACC AGG CTA 2 064 

Cvs Asn Leu He Pro Lys He Tyr Ala Arg Asn Lys Lys Thr Arg Leu 
675 680 685 

GAC GAG TTG GGC CGC AAT GCA AAC TTC ATT TCG TTC GTC GCC ACC ACG 2112 
Asp Glu Leu Gly Arg Asn Ala Asn Phe He Ser Phe Val Ala Thr Thr 
690 695 700 

GGT CAT CGG TTC GCC GCT CTA AAG CCA CAA ATT GTC CGT CAC GCC ATT 216 0 

Glv His Arg Phe Ala Ala Leu Lys Pro Gin He Val Arg His Ala He 
705 710 715 720 

CGC AAA CTA GGC CTG CAC TGG CGC CAC CGA ACG GCC GCG TCC AAC GAG 22 08 

Arg Lys Leu Gly Leu His Trp Arg His Arg Thr Ala Ala Ser Asn Glu 
725 730 735 



TAA 



Z2S€ 



1259 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 752 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE': protein 

Cxi) SEQUENCE DESCRIPTION : SEQ ID NO: 21: 

Me- Ala Ala Leu Glu Glv Pro Leu Leu Leu Pro Pro Ser Ala Ser Leu 

1 5 " " * 10 15 

T w Th" Ser Pro Gin Thr Thr Cys Tvr Gin Ala Thr Trp Glu Ser Gin 
20 25 30 

Leu Glu lie Phe Cvs Cvs Leu Ala Thr Asn Ser His Leu Gin Ala Glu 
35 40 4d 

Leu Thr Leu Glu Gly Leu Asp Lys Met Met Gin Pro Glu Pro Thr Phe 
50 55 60 

Ph» Ala Cvs Arg Ala lie Arg Arg Leu Leu Leu Gly Glu Arg Leu His 
65 ' 70 75 80 

Pro Phe lie His Gin Glu Gly Thr Leu Leu Gly Lys Val Gly Arg Arg 

85 90 95 

Tyr Ser Gly Glu Gly Leu lie lie Asp Gly Gly Gly Val Phe Thr Arg 
100 105 110 

Glv Gin He Asp Thr Asp Asn Tyr Leu Pro Ala Val Gly Ser Trp Glu 
115 120 125 

Leu Thr Asp Asp Cys Asp Lys Pro Cys Glu Phe Arg Glu Leu Arg Ser 
130 135 140 

Leu Tyr Leu Pro Ala Leu Leu Thr Cys Thr He Cys Tyr Lys Ala Met 
145 150 155 160 

Phe Arg He Val Cys Arg Tyr Leu Glu Phe Trp Glu Phe Glu Gin Cys 
165 170 175 

Phe His Ala Phe Leu Ala Val Leu Pro His Ser Leu Gin Pro Thr He 
180 185 190 

Tyr Gin Asn Tyr Phe Ala Leu Leu Glu Ser Leu Lys His Leu Ser Phe 
195 * 200 205 

Se- He Met Pro Pro Ala Ser Pro Asp Ala Gin Leu His Phe Leu Lys 
210 215 220 

Phe Asn He Ser Ser Phe Met Ala Thr Trp Gly Trp His Gly Glu Leu 
225 230 235 240 

Va 1 Se- Leu Arg Arc Ala He Ala His Asn Val Glu Arg Leu Pro Thr 
245 250 255 

Va 1 L*u Lvs Asn Leu Ser Lys Gin Ser Lys His Gin Asp Val Lys Val 
' 260 265 270 

As- Glv Arg Asp Leu Val Gly Phe .Gin Leu Ala Leu Asn Gin Leu Val 
* 275 280 2E5 

S— Arc Leu His Val Lys He Gin Arg Lys Asp Pro Gly Pro Lys Pro 
290 295 300 

Tvr Arc Val Val Val Ser Thr Pro Asr> Cvs Thr Tyr Tyr Leu Val Tyr 
303 ' 310 315 ' 320 

Pro Gly Thr Pro Ala He Tyr Arg Leu Val Met Cys Met Ala Val Ala 
325 330 335 
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Gly His Arg Phe Ala Ala Leu Lys 
705 710 

Arg Lvs Leu Gly Leu His Trp Arg 
725 

Gin Thr Pro Pro Ala Asp Pro Arg 
740 



225 



700 

Pro Gin lie Val Arg His Ala He 
715 720 

His Arg Thr Ala Ala Ser Asn Glu 
730 735 

Val Arg Cys Val Arg Pro Leu Val 
745 750 



(2) INFORMATION FOR SEQ ID NO : 22 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

MOLECULE TYPE: DNA (genomic) 



Ciii) HYPOTHETICAL: N 
(iv) ANTI- SENSE: N 

{ ix ) FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 1.-364 
(D) OTHER INFORMATION: 



(xii SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



ATG GTA CGT CCA ACC GAG GCC GAG GTT AAG AAA TCC CTG AGC AGG CTT 4 8 

Met Val Arg Pro Thr Glu Ala Glu Val Lys Lys Ser Leu Ser Arg Leu 
15 10 15 

CCA GCA GCA CGC AAA AGA GCA GGT AAC CGG GCC CAC CTG GCC ACC TAC 9-6 
Pro Ala Ala Arg Lys Arg Ala Gly Asn Arg Ala His Leu Ala Thr Tyr 
20 25 30 

CGC CGG CTC CTC AAG TAC TCC ACC CTG CCC GAT CTA TGG CGG TTT CTA 144 
Arg Arg Leu Leu Lys Tyr Ser Thr Leu Pro Asp Leu Trp Arg Phe Leu 
35 40 45 

AGT AGC CGG CCC CAG AAC CCT CCC CTT GGA CAC CAC AGA TTA TTC TTT 192 
Ser Ser Arg Pro Gin Asn Pro Pro Leu Gly His His Arg Leu' Phe Phe 
50 55 60 

GAG GTG ACT CTA GGG CAC AGA ATT GCC GAC TGC GTA ATT CTG GTA TCG 24 0 

Glu Val Thr Leu Gly His Arg He Ala Asp Cys Val He Leu Val Ser 
65 70 75 80 

GGT GGG CAT CAG CCC GTA TGT TAC GTT GTA GAG CTC AAG ACT TGT CTG 288 
Glv Glv His Gin Pro Val Cys Tvr Val Val Glu Leu Lys Thr Cys Leu 

85 90 95 

AGT CAC CAG CTG ATC CCA ACC AAC ACC GTG AGA ACG TCA CAG CGA GCT 336 
Ser His Gin Leu He Pro Thr Asn Thr Val Arg Thr Ser Gin Arg Ala 
100 105 HO 

CAA GGC CTG TGC CAA CTC TCC GAC TCG A 364 
Gin Glv Leu Cys Gin Leu Ser Asp Ser 
115 120 
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(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: ammo acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Met Val Ara Pro Thr Glu Ala Glu Val Lys Lys Ser Leu Ser Arg Leu 
1^5 10 15 

Pro Ala Ala Arg Lvs Arg Ala Gly Asn Arg Ala His Leu Ala Thr Tyr 
20 ' 25 30 

Arg Arg Leu Leu Lys Tyr Ser Thr Leu Pro Asp Leu Trp Arg Phe Leu 
35 40 45 

Ser Ser Arg Pro Gin Asn Pro Pro Leu Gly His His Arg Leu Phe Phe 
50 55 60 

Glu Val Thr Leu Gly His Arg lie Ala Asp Cys Val lie Leu Val Ser 
65 70 75 80 

Gly Glv His Gin Pro Val Cys Tyr Val Val Glu Leu Lys Thr Cys Leu 

85 90 95 

Ser His Gin Leu lie Pro Thr Asn Thr Val Arg Thr Ser Gin Arg Ala 
100 105 110 

Gin Gly Leu Cys Gin Leu Ser Asp Ser 
115 120 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91B base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 
(iii) HYPOTHETICAL: N 
(iv) ANTI -SENSE: N 

( ix } FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..918 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

ATG GCA CTC GAC AAG AGT ATA GTG GTT AAC TTC ACC TCC AGA CTC TTC 
Met Ala Leu Asp Lys Ser lie Val Val Asn Phe Thr Ser Arg Leu Phe 
1 5 1° 13 

G~T GAT GAA CTG GCC GCC CTT CAG TCA AAA ATA GGG AGC GTA CTG CCG 
Ala Asp Glu Leu Ala Ala Leu Gin Ser Lys lie Gly Ser Val Leu Pro 

20 25 30 



48 



96 
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CTC GGA GAT TGC CAC CGT TTA CAA AAT ATA CAG GCA TTG GGC CTG GGG 144 

Leu Glv Asp Cvs His Arg Leu Gin Asn lie Gin Ala Leu Gly Leu Gly 

35 " 40 45 

TGC GTA TGC TCA CGT GAG ACA TCT CCG GAC TAC ATC CAA ATT ATG CAG 192 

Cvs Val Cvs Ser Arg Glu Thr Ser Pro Asp Tyr lie Glr. lie Met Gin 

50 ' 55 60 

TAT CTA TCC AAG TGC ACA CTC GCT GTC CTG GAG GAG GTT CGC CCG GAC 24 0 

Tvr Leu Ser Lys Cys Thr Leu Ala Val Leu Glu Glu Val Arg Pro Asp 

65 70 75 80 

AGC CTG CGC CTA ACG CGG ATG GAT CCC TCT GAC AAC CTT CAG ATA AAA 2 88 

Ser Leu Arg Leu Thr Arg Met Asp Pro Ser Asp Asn Leu Gin lie Lys 

85 90 95 

AAC GTA TAT GCC CCC TTT TTT CAG TGG GAC AGC AAC ACC CAG CTA GCA 3 36 

Asn Val Tvr Ala Pro Phe Phe Gin Trp Asp Ser Asn Thr Gin Leu Ala 

100 105 110 

GTG CTA CCC CCA TTT TTT AGC CGA AAG GAT TCC ACC ATT GTG CTC GAA 3 84 

Val Leu Pro Pro Phe Phe Ser Arg Lys Asp Ser Thr lie Val Leu Glu 

115 120 125 

TCC AAC GGA TTT GAC CCC GTG TTC CCC ATG GTC GTG CCG CAG CAA CTG 4 32 

Ser Asn Gly Phe Asp Pro Val Phe Pro Met Val Val Pro Gin Gin Leu 

130 * 135 140 

GGG CAC GCT ATT CTG CAG CAG CTG TTG GTG TAC CAC ATC TAC TCC AAA 480 

Gly His Ala lie Leu Gin Gin Leu Leu Val Tyr His lie Tyr Ser Lys 

145 150 155 160 

ATA TCG GCC GGG GCC CCG GAT GAT GTA AAT ATG GCG GAA CTT GAT CTA 52 8 

He Ser Ala Glv Ala Pro Asp Asp Val Asn Met Ala Glu Leu Asp Leu 

165 170 175 

TAT ACC ACC AAT GTG TCA TTT ATG GGG CGC ACA TAT CGT CTG GAC GTA 5 76 

Tvr Thr Thr Asn Val Ser Phe Met Gly Arg Thr Tyr Arg Leu Asp Val 

180 185 190 

GAC AAC ACG GAT CCA CGT ACT GCC CTG CGA GTG CTT GAC GAT CTG TCC 624 

Asd Asn Thr Asp Pro Arg Thr Ala Leu Arg Val Leu Asp Asp Leu Ser 

195 200 205 

ATG TAC CTT TGT ATC CTA TCA GCC TTG GTT CCC AGG GGG TGT CTC CGT 672 

Met Tyr Leu Cys lie Leu Ser Ala Leu Val Pro Arg Gly Cys Leu Arg 

210 215 220 

CTG CTC ACG GCG CTC GTG CGG CAC GAC AGG CAT CCT CTG ACA GAG GTG 72 0 

Leu Leu Thr Ala Leu val Arg His Asp Arg His Pro Leu Thr Glu Val 

225 230 235 240 

TTT GAG GGG GTG GTG CCA GAT GAG GTG ACC AGG ATA GAT CTC GAC CAG 768 

Phe Glu Glv Val Val Pro Asp Glu Val Thr Arg lie Asp Leu Asp Gin 

245 250 255 

TTG AGC GTC CCA GAT GAC ATC ACC AGG ATG CGC GTC ATG TTC TCC TAT 816 

Leu Ser Val Pro Asp Asp He Thr Arg Met Arg Val Met Phe Ser Tyr 

260 265 270 

CTT CAG AGT CTC AGT TCT ATA TTT AAT CTT GGC CCC AGA CTG CAC GTG 8 64 
Leu Gin Ser Leu Ser Ser He Phe Asn Leu Gly Pro Arg Leu His Val 

275 280 285 

TAT GCC TAC TCG GCA GAG ACT TTG GCG GCC TCC TGT TGG TAT TCC CCA 912 

Tyr Ala Tyr Ser Ala Glu Thr Leu Ala Ala Ser Cys Trp Tyr Ser Pro 

290 295 300 
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CGC TAA 916 

Arg 

305 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 05 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

fxi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Me* Ala Leu Asd Lys Ser lie Val Val Asn Phe Thr Ser Arg Leu Phe 
1*5 10 15 

Ala Asp Glu Leu Ala Ala Leu Gin Ser Lys lie Gly Ser Val Leu Pro 
20 25 30 

Leu Glv Asp Cvs His Ara Leu Gin Asn He Gin Ala Leu Gly Leu Gly 
* 35 40 45 

Cys Val Cys Ser Arg Glu Thr Ser Pro Asp Tyr He Gin He Met Gin 
50 55 60 

Tvr Leu Ser Lvs Cys Thr Leu Ala Val Leu Glu Glu Val Arg Pro Asp 
65 * 70 75 80 

Ser Leu Arg Leu Thr Arg Met Asp Pro Ser Asp Asn Leu Gin He Lys 

85 90 95 

Asn Val Tvr Ala Pro Phe Phe Gin Trp Asp Ser Asn Thr Gin Leu Ala 
100 105 HO 

Val Leu Pro Pro Phe Phe Ser Arg Lys Asp Ser Thr lie Val Leu Glu 
115 120 125 

Se- Asn Gly Phe Asp Pro Val Phe Pro Met Val Val Pro Gin Gin Leu 
130 135 140 

Gly His Ala He Leu Gin Gin Leu Leu Val Tyr His He Tyr Ser Lys 
145 150 155 160 

He Ser Ala Gly Ala Pro Asp Asp Val Asn Met Ala Glu Leu Asp Leu 
165 170 175 

Tv- Tl— Thr Asn Val Ser Phe Met Gly Arg Thr Tyr Arg Leu Asp Val 
160 185 190 

Asp Asn Th- Asp Pro Arg Thr Ala Leu Arg Val Leu Asp Asp Leu Ser 
155 200 205 

M*t Tv- Leu Cvs He Leu Ser Ala Leu Val Pro Arg Gly Cys Leu Arg 
2io " 215 220 

L«u Leu Thr Ala Leu Val Arg His Asp Arg His Pro Leu Thr Glu val 
225 230 235 240 

Phe Glu Glv Val Val Pro Asp Glu Val Thr Arg He Asp Leu Asp Gin 
245 2S0 255 

Leu Ser Val Pro Asp Asp He Thr Arg Met Arg Val Met Phe Ser Tyr 
260 265 270 
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Leu Gin Ser Leu 
275 

Tyr Ala Tyr Ser 
290 



Ser Ser lie Phe 
280 

Ala Glu Thr Leu 
295 



229 

Asn Leu Gly Pro 

Ala Ala Ser Cvs 
300 



Arg Leu His Val 
285 

Trp Tyr Ser Pro 



Arg 
305 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 873 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: N 
(iv) ANTI-SENSE: N 

£ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. -873 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

ATG GCG TCA TCT GAT ATT CTG TCG GTT GCA AGG ACG GAT GAC GGC TCC 48 
Met Ala Ser Ser Asp He Leu Ser Val Ala Arg Thr Asp Asp Gly Ser 
x 5 10 15 

GTC TGT GAA GTC TCC CTG CGT GGA GGT AGG AAA AAA ACT ACC GTC TAC 96 
Val Cys Glu Val Ser Leu Arg Gly Gly Arg Lys Lys Thr Thr Val Tyr 
20 25 30 

CTG CCG GAC ACT GAA CCC TGG GTG GTA GAG ACC GAC GCC ATC AAA GAC 144 
Leu Pro Asp Thr Glu Pro Trp Val Val Glu Thr Asp Ala He Lys Asp 
35 40 45 

GCC TTC CTC AGC GAC GGG ATC GTG GAT ATG GCT CGA AAG CTT CAT CGT 192 
Ala Phe Leu Ser Asp Gly He Val Asp Met Ala Arg Lys Leu His Arg 
50 55 60 

GG t GCC CTG CCC TCA AAT TCT CAC AAC GGC TTG AGG ATG GTG CTT TTT 24 0 

Glv Ala Leu Pro Ser Asn Ser His Asn Gly Leu Arg Met Val Leu Phe 
65 7 ° 75 80 

' TG^ TA~ TGT TAC TTG CAA AAT TGT GTG TAC CTA GCC CTG TTT CTG TGC 288 
Cvs Tvr Cys Tvr Leu Gin Asn Cys Val Tyr Leu Ala Leu Phe Leu Cys 
* 85 90 95 

CC~ CTT AA~ CCT TAC TTG GTA ACT CCC TCA AGC ATT GAG TTT GCC GAG 3 36 

Pro Leu Asn Pro Tyr Leu Val Thr Pro Ser Ser lie Glu Phe Ala Glu 
100 1° 5 110 

_ GTT GTG GCA CCT GAG GTG CTC TTC CCA CAC CCG GCT GAG ATG TCT 3 84 

Pro Val Val Ala Pro Glu Val Leu Phe Pro His. Pro Ala Glu Met Ser 
12 .5 120. 125 
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ATA ATC AAC ACC ACG TTT GGA CGC ATT TAC CCG AAC TCT ACA CGC GAG 4 80 

lie lie Asn Thr Thr Phe Giy Arg lie Tyr Pro Asn Ser Thr Arg Glu 
145 150 155 160 

CCG GAC GGC AGG CCT ACG GAT TAC TCC ATG GCC CTT AGA AGG GCT TTT 52 a 

Pro Asd Gly Arc Pro Thr Asp Tyr Ser Met Ala Leu Arg Arg Ala Phe 
165 170 175 

GCA GTT ATG GTT AAC ACG TCA TGT GCA GGA GTG ACA TTG TGC CGC GGA 5 76 

Ala Val Met Val Asn Thr Ser Cys Ala Gly Val Thr Leu Cys Arg Gly 

180 195 190 

GAA ACT CAG ACC GCA TCC CGT AAC CAC ACT GAG TGG GAA AAT CTG CTG 6 24 

Glu Thr Gin Thr Ala Ser Arg Asn His Thr Glu Trp Glu Asn Leu Leu 
195 200 205 

GCT ATG TTT TCT GTG ATT ATC TAT GCC TTA GAT CAC AAC TGT CAC CCG 6 72 

Ala Met Phe Ser Val lie lie Tyr Ala Leu Asp His Asn Cys His Pro 
210 215 220 

GAA GCA CTG TCT ATC GCG AGC GGC ATC TTT GAC GAG CGT GAC TAT GGA 72 0 

Glu Ala Leu Ser lie Ala Ser Gly lie Phe Asp Glu Arg Asp Tyr Gly 
225 230 235 240 

TTA TTC ATC TCT CAG CCC CGG AGC GTG CCC TCG CCT ACC CCT TGC GAC 76 8 

Leu Phe lie Ser Gin Pro Arg Ser Val Pro Ser Pro Thr Pro Cys Asp 
245 250 255 

GTG TCG TGG GAA GAT ATC TAC AAC GGG ACT TAC CTA GCT CGG CCT GGA 816 

Val Ser Trp Glu Asp He Tyr Asn Gly Thr Tyr Leu Ala Arg Pro Gly 

260 265 270 

AAC TGT GAC CCC TGG CCC AAT CTA TCC ACC CCT CCC TTG ATT CTA AAT 864 

Asn Cys Asd Pro Trp Pro Asn Leu Ser Thr Pro Pro Leu He Leu Asn 
275 280 285 



t ii AAA TAA 8 73 

Phe Lys 
290 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 290 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

Met Ala Ser Ser Asp He Leu Ser Val Ala Arg Thr Asp Asp Gly Ser 

15 10 15 

Val Cvs Glu Val Ser Leu Arg Gly Gly Arg Lys Lys Thr Thr Val Tyr 
20 25 30 

Leu Pro Asp Thr Glu Pro Trp Val Val Glu Thr Asp Ala He Lys Asp 
35 40 45 

Ala Phe Leu Ser Asp Gly He Val Asp Met Ala Arg Lys Leu His Arg 
50 55 €0 

Glv Ala Leu Pro Ser Asn Ser His Asn Gly Leu Arg Met Val Leu Phe 
65 70 75 80 
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Cys Tyr Cys Tyr Leu Gin Asn Cys_ Val_ Tyr Leu Ala Leu Phe Leu Cys 

35 "90 " " 95 

Pro Leu Asn Pro Tyr Leu Val Thr Pro Ser Ser lie Glu Phe Ala Glu 
100 105 110 

Pro Val Val Ala Pro Glu Val Leu Phe Pro His Pro Ala Glu Met Ser 
115 120 125 

Arg Glv Cvs Asp Asp Ala lie Phe Cys Lys Leu Pro Tyr Thr Val Pro 
130 * 135 140 

lie lie Asn Thr Thr Phe Gly Arg lie Tyr Pro Asn Ser Thr Arg Glu 
145 150 155 160 

Pro Asp Gly Arg Pro Thr Asp Tyr Ser Met Ala Leu Arg Arg Ala Phe 
165 170 175 

Ala Val Met Val Asn Thr Ser Cys Ala Gly Val Thr Leu Cys Arg Gly 
180 185 190 

Glu Thr Gin Thr Ala Ser Arg Asn His Thr Glu Trp Glu Asn Leu Leu 
195 200 205 

Ala Met Phe Ser Val lie lie Tyr Ala Leu Asp Hxs Asn Cvs His Pro 
210 215 220 

Glu Ala Leu Ser lie Ala Ser Gly lie Phe Asp Glu Arg Asp Tyr Gly 
225 230 235 240 

Leu Phe lie Ser Gin Pro Arg Ser Val Pro Ser Pro Thr Pro Cys Asp 
245 250 255 

Val Ser Trp Glu Asp He Tyr Asn Gly Thr Tyr Leu Ala Arg Pro Gly 
260 265 270 

Asn Cvs Asp Pro Trp Pro Asn Leu Ser Thr Pro Pro Leu He Leu Asn 
275 280 285 



Phe Lys 
290 



(2) INFORMATION FOR SEQ ID NO: 28: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 3 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: N 
(iv) ANTI-SENSE: N 



( ix } FEATURE : 

(A) NAME / KEY : CDS 

(B) LOCATION: 1..363 
(D) OTHER INFORMATION: 



(XX) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

ATG AGC ATG ACT TTC CCC GTC TCC AGT CAC CGG AGG AAT GGT GGA CGG 

Met Ser Met Thr Phe Pro Val Ser Ser His Arg Arg Asn Gly Gly Arg 
15 10 15 
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CT^ CGT CCT GGT GCG AAT GGC CAC CAA GCC TCC CGT GAT TGG TCT TAT 
Leu Arg Pro Glv Ala Asn Gly His Gin Ala Ser Arg Asp Trp Ser Tyr 
20 25 30 



96 



AAC AGT GCT CTT CCT CCT AGT CAT AGG CGC CTG CGT CTA CTG CTG CAT 14 4 

Asn Ser Ala Leu Pro Pro Ser His Arg Arg Leu Arg Leu Leu Leu His 
35 40 45 

TCG CG" GTT CCT GGC GGC TCG ACT GTG GCG CGC CAC CCC ACT AGG CAG 192 
Se" Ara Val Pro Gly Gly Ser Thr Val Ala Arg His Pro Thr Arg Gin 
50 55 60 

GGC CA- CG"" GGC GTA TCA GGT CCT TCG CAC CCT GGG ACC GCA GGC CGG 24 0 

Glv H-s Arq Glv Val Ser Gly Pro Ser His Pro Gly Thr Ala Gly Arg 
65 * ' 70 75 SO 

G-C ACA TGC ACC GCC GAC GGT GGG CAT AGC TAC CCA GGA GCC CTA CCG 2BB 
Va 1 Thr Cvs Thr Ala Asp Gly Gly His Ser Tyr Pro Gly Ala Leu Pro 

85 90 95 

TA^ AAT ATA CAT GCC AGA TTA GAA CGG GGT GTG TGC TAT AAT GGA TGG 336 
TV- Asn lie His Ala Arg Leu Glu Arg Gly Val Cys Tyr Asn Gly Trp 
100 105 HO 

CTA TGG GGG GGG GCT GTA GAT AAT TGA 3S3 
Leu Trp Gly Gly Ala Val Asp Asn 
115 120 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Me- Se- Met Thr Phe Pro Val Ser Ser His Arg Arg Asn Gly Gly Arg 
r * 5 io 15 

Leu Arg Pro Gly Ala Asn Gly His Gin Ala Ser Arg Asp Trp Ser Tyr 
20 25 30 

Asn Ser Ala Leu Pro Pro Ser His Arg Arg Leu Arg Leu Leu Leu His 
35 40 45 

Se- Arc Val Pro Gly Gly Ser Thr Val Ala Arg His Pro Thr Arg Gin 
SO 55 60 

Gl" H* s Arg Gly Val Ser Gly Pro Ser His Pro Gly Thr Ala Gly Arg 
65 " 70 75 80 

Va -- tu^ ~ vs Ala Asp Gly Gly His Ser Tyr Pro Gly Ala Leu Pro 

85 90 95 

As _ t 1g His Aia Arg Leu Glu Arg Gly Val Cys Tyr Asn Gly Trp 
100 105 HO 

Leu Trr> Gly Glv Ala Val Asp Asn 
- i i5 * 120 

(2) INFORMATION FOR SEQ ID NO: 30: 
( i ) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 921 base pairs 

(B) TYPE: nucleic acid 
£C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: N 

(iv) ANTI- SENSE: N 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION : 1 . . 921 
(D) OTHER INFORMATION: 



48 



9c 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

ATG CTG CTC AGC CGT CAC AGG GAG CGC CTT GCC GCC AAC CTG GAG GAG 

Met Leu Leu Ser Arg His Arg Glu Arg Leu Ala Aia Asn Leu Glu Glu 
1 5 10 15 

ACC GCC AAA GAC GCC GGA GAG AGG TGG GAA CTG AGT GCC CCG ACA TTC 

Th- Ala Lys Asd Ala Gly Glu Arg Trp Glu Leu Ser Ala Pro Thr Phe 
2 0 25 3 0 

ACG CGA CAC TGT CCC AAA ACG GCA CGG ATG GCG CAC CCT TTT ATT GGC 144 

Thr Arg His Cys Pro Lys Thr Ala Arg Met Ala His Pro Phe lie Gly 
35 40 45 

GTG GTG CAC AGA ATA AAC TCA TAC AGT TCG GTC CTG GAA ACA TAC TGC 192 

Val Val His Arg He Asn Ser Tyr Ser Ser Val Leu Glu Thr Tyr Cys 
50 55 60 

ACA CGG CAC CAT CCC GCC ACG CCC ACG TCA GCA AAT CCC GAC GTG GGA 24 0 

Thr Ara His His Pro Ala Thr Pro Thr Ser Ala Asn Pro Asp Val Gly 
65 " . 70 75 80 

ACC CCC AGA CCG TCC GAG GAC AAC GTC CCC GCA AAG CCG CGC CTA TTG 288 

Th- Pro Arg Pro Ser Glu Asp Asn Val Pro Ala Lvs Pro Arg Leu Leu 

85 90 95 

GAG TCC CTA TCA ACA TAC TTG CAG ATG CGG TGT GTG CGC GAG GAC GCG 3 36 

Glu Ser Leu Ser Thr Tyr Leu Gin Met Arg Cys Val Arg Glu Asp Ala 
100 105 110 

CAC GTC TCC ACG GCC GAT CAA CTG GTC GAG TAC CAG GCG GGC AGA AAA 3 84 

His Val Ser Thr Ala Asp Gin Leu Val Glu Tyr Gin Ala Gly Arg Lys 
115 120 125 

ACA CAC GAC TCC CTG CAC GCC TGC TCT GTC TAC CGC GAA CTT CAG GCT 

Tr- His Asp Ser Leu His Ala Cys Ser Val Tyr Arg Glu Leu Gin Ala 
130 135 140 

TTT CTG GTT AAC CTT TCG TCC TTT CTG AAC GGC TGT TAC GTT CCC GGG 

Phe Leu Val Asn Leu Ser Ser Phe Leu Asn Gly Cys Tyr Val Pro Gly 

145 150 155 160 

GTC- CAC TGG CTG GAG CCC TTC CAA CAG CAG CTA GTA ATG CAC ACT TTT 52 8 

Va"! His Trp Leu Glu Pro Phe Gin Gin Gin Leu Val Met His Thr Phe 
165 170 175 

^3 GTT TCA ATC aag GCC CCA CAA AAG ACG CAC CAG TTG TTT 576 
Phe Phe Leu Val Ser He Lys Ala Pro Gin Lys Thr His Gin Leu Phe 
180 165 190 
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GGA TTG TTT AAG CAG TAC TTC GGT TTA TTT GAA ACT CGA AAC AGT GTT 624 
Glv Leu Phe Lys Gin Tyr Phe Gly Leu Phe Glu Thr Pro Asn Ser Val 
195 200 205 

T^A CAG ACG TTT AAG CAA AAG GCA AGC GTA TTC CTA ATA CCA AGG AGA 6 71 

Leu Gin Thr Phe Lys Gin Lys Ala Ser Val Phe Leu He Pro Arg Arg 
210 215 220 

r-A" GGA AAG ACA TGG ATA GTG GTG GCG ATC ATC AGC ATG CTA CTG GCA 72 0 

His Glv Lvs Thr Trp He Val Val Ala He He Ser Met Leu Leu Axa 
2*25 " " 230 235 240 

T — GTA GAG AAC ATT AAC ATT GGG TAC GTA GCC CAC CAA AAG CAC GTA *?6E 
Se- val Glu Asn He Asn He Gly Tyr Val Ala His Gin Lys His Val 
245 250 255 

G~~ AA~ TCC GTG TTC GCG GAA ATC ATA AAG ACG CTT TGT CGG TGG TTC 816 
Ala As- Ser Val Phe Ala Glu He He Lys Thr Leu Cys Arg Trp Phe 
260 265 270 

CCC CCC AAA AAT TTA AAC ATC AAG AAG GAG AAC GGA ACC ATA ATC TAC 864 
Pro Pro Lvs Asn Leu Asn He Lvs Lys Glu Asn Gly Thr He He Tyr 
275 280 285 

ACG CGA CCC GGA GGA CGG TCC AGC TCG CTG ATG TGC GCA ACA TGC TTC 912 
Thr Arg Pro Glv Gly Arg Ser Ser Ser Leu Met Cys Ala Thr Cys Phe 
290 * 295 300 



AAT AAG AAC 
Asn Lys Asn 
305 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 307 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Me- Leu Leu Ser Arg His Arg Glu Axg Leu Ala Ala Asn Leu Glu Glu 
I 5 10 15 

T w - Ala Lvs Asd Ala Gly Glu Arg Trp Glu Leu Ser Ala Pro Thr Phe 

20 25 30 • 

Ara H< s Cvs Pro Lys Thr Ala Arg Met Ala His Pro Phe He Gly 
" 35 " 40 45 

Val Val His Arg He Asn Ser Tyr Ser Ser Val Leu Glu Thr Tyr Cys 
50 55 60 

-r- Arg His His Pro Ala Thr Pro Thr Ser Ala Asn Pro Asp Val Gly 
*€5 70 75 80 

Thr Pro Arg Pro Ser Glu Asp Asn Val Pro Ala Lys Pro Arg Leu Leu 
85 90 95 

Glu Ser Leu Ser Thr Tyr Leu Gin Met Arg Cys Val Arg Glu Asp Ala 
100 105 HO 

His Va 1 Ser Thr Ala Aso Gin Leu Val Glu Tyr Gin Ala Gly Arg Lys 
H5 * 120 125 



921 
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Th- His Asp Ser Leu His Ala Cys Ser Val Tyr Arg Glu Leu Gin Ala 
130 135 140 

Ph* Leu Val Asn Leu Ser Ser Phe Leu Asn Gly Cys Tyr Val Pro Gly 

14 5 150 155 160 

Va^ His Trp Leu Glu Pro Phe Gin Gin Gin Leu Val Met His Thr Phe 
165 170 175 

P"« Phe Leu Val Ser lie Lvs Ala Pro Gin Lys Thr His Gin Leu Phe 
180 * IBS 190 

Glv Leu Phe Lvs Gin Tyr Phe Gly Leu Phe Glu Thr Pro Asn Ser Val 
195 * 200 205 

Leu Gin Thr Phe Lys Gin Lys Ala Ser Val Phe Leu lie Pro Arg Arg 
210 215 220 

His Glv Lys Thr Trp He Val Val Ala He He Ser Met Leu Leu Ala 

225 " 230 235 240 

Se- Va 1 Glu Asn He Asn He Gly Tyr Val Ala His Gin Lys His Val 

245 ' 250 255 

Ala Asn Ser Val Phe Ala Glu He He Lys Thr Leu Cys Arg Trp Phe 
260 265 270 

p^o Pro Lvs Asn Leu Asn He Lys Lys Glu Asn Gly Thr He He Tyr 
275 280 285 

Thr Arg Pro Gly Gly Arg Ser Ser Ser Leu Met Cys Ala Thr Cys Phe 
290 295 300 



Asn Lys Asn 
305 

12) INFORMATION FOR SEQ ID NO: 32: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1365 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: N 



(iv) ANT I- SENSE: N 



( IX ! FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .136 5 
(D) OTHER INFORMATION: 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



A-G GA~ GCG CAT GCT ATC AAC GAA AGA TAC GTA GGT CCT CGC TGC CAC 4 8 

M«t Asp Ala His Ala He Asn Glu Arg Tyr Val Gly Pro Arg Cys His 

n 5 10 15 



96 



CG~ TTG GCC CAC GTG GTG CTG CCT AGG ACC TTT CTG CTG CAT CAC GCC 

Arg Leu Ala His Val Val Leu Pro Arg Thr Phe Leu Leu His His Ala 
20 25 30 

ATA CCC CTG GAG CCC GAG ATC ATC TTT TCC ACC TAC ACC CGG TTC AGC 144 
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He 


Pro 


Leu 
35 


Glu 


Pro 


Glu 


He 


lie- 
40 


Phe 


Ser 


Thr 


Tyr 


-Thr 
45 


Arg 


Phe 


Ser 


CGG 
Arg 


TCG 
Ser 

50 


CCA 
Pro 


GGG 
Gly 


TCA 
Ser 


TCC 
Ser 


CGC 
Arg 
55 


CGG 
Arg 


TTG 
Leu 


GTG 
Val 


GTG 
Val 


TGT 
Cvs 
60 


GGG 

Gly 


AAA 

Lys 


CGT 
Arg 


GTC 

Val 


CTG 
Leu 
€5 


CCA 
Pro 


GGG 

Gly 


GAG 
Glu 


GAA 
Glu 


AAC 
Asn 
70 


CAA 
Gin 


CTT 
Leu 


GCG 
Ala 


^CT 
Ser 


TCA 
Ser 
75 


Pro 


Ser 


Gly 


TTG 
Leu 


GCG 
Ala 
80 


Leu 


AGC 
Ser 


CTG 
Leu 


CCT 

Pro 


CTG 
Leu 
85 


Phe 


TCC 
Ser 


CAC 
His 


GAT 
Asp 


GGG AAC 
Gly Asn 
90 


TTT 
Phe 


CAT 
His 


CCA 
Pro 


Phe 
95 


GAC 
Asp 


ATC 
He 


TCG 
Ser 


GTA 

Val 


CTG 
Leu 
100 


CGC 
Arg 


ATT 
He 


TCC 
Ser 


TGC 

Cys 


CCT 
Pro 
105 


GGT 

Gly 


TCT 
Ser 


AAT 

Asn 


CTT 
Leu 


AGT 
Ser 
110 


CTT 
Leu 


ACT 
Thr 


GTC 
Val 


AGA 
Arg 


Phe 


Leu 


TAT 
Tyr 


CTA 
Leu 


TCT 
Ser 


CTG 
Leu 
120 


GTG 
Val 


GTG 
Val 


GCT 
Ala 


ATG 
Met. 


GGG 
Gly 
125 


GCG 
Ala 


GGA 

Gly 


CGG 
Arg 


AAT 
Asn 


AAT 
Asn 
130 


GCG 
Ala 


CGG 
Arg 


AGT 
Ser 


CCG 
Pro 


ACC 
Thr 
135 


GTT 
val 


GAC 
Asp 


GGG GTA 
Gly Val 


TCG 
Ser 
140 


CCG 
Pro 


CCA 
Pro 


GAG 
Glu 


GGC 
Gly 


GCC 
Ala 
145 


GTA 
Val 


GCC 
Ala 


CAC 
His 


CCT 
Pro 


TTG 
Leu 
150 


GAG 
Glu 


GAA 
Glu 


CTG 
Leu 


CAG 
Gin 


AGG 
Arg 
155 


CTG 
Leu 


GCG 
Ala 


CGT 
Arg 


GCT 
Ala 


ACG 
Thr 
160 


CCG 
Pro 


GAC 
Asp 


CCG 
Pro 


GCA 
Ala 


CTC 
Leu 
165 


ACC 
Thr 


CGT 
Arg 


GGA 
Gly 


CCG 
Pro 


TTG 
Leu 
170 


CAG 
Gin 


GTC 
Val 


CTG 
Leu 


ACC 
Thr 


GGC 
Gly 
175 


CTT 
Leu 


CTC 
Leu 


CGC 
Arg 


GCA 
Ala 


GGG 
Gly 
180 


TCA 
Ser 


GAC 
Asp 


GGA 
Gly 


GAC 
Asp 


CGC 
Arg 
185 


GCC 
Ala 


ACT 
Thr 


CAC 
His 


CAC 
His 


ATG 
Met 
190 


GCG 
Ala 


CTC 
Leu 


GAG 
Glu 


GCT 
Ala 


CCG 
Pro 
195 


GGA 

Gly 


ACC 
Thr 


GTG 
Val 


CGT 
Arg 


GGA 
Gly 
200 


GAA 
Glu 


AGC 
Ser 


CTA 
Leu 


GAC 
Asp 


CCG 
Pro 
205 


CCT 
Pro 


GTT 

Val 


TCA 
Ser 


CAG 
Gin 


AAG 
Lvs 
2*10 


GGG 
Gly 


CCA 
Pro 


GCG 
Ala 


CGC 
Arg 


ACA 
Thr 
215 


CGC 
Arg 


CAC 
His 


AGG 
Arg 


CCA 
Pro 


CCC 
Pro 

220 


CCC 
Pro 


GTG 
Val 


CGA 
Arg 


CTG 
Leu 


AGC 
Ser 
225 


TTC 
Phe 


AAC 
Asn 


ccc 
Pro 


GTC 
Val 


AAT 
Asn 
230 


GCC 
Ala 


GAT 
Asp 


GTA 
Val 


CCC 
Pro 


GCT 
Ala 
235 


ACC 
Thr 


TGG 
Trp 


CGA 
Arg 


GAC 
Asp 


GCC 
Ala 
240 


ACT 
Thr 


AAC 
Asn 


GTG 
Val 


TAC 
Tyr 


TCG 
Ser 
245 


GGT 

Gly 


GCT 
Ala 


CCC 
Pro 


TAC 
Tyr 


TAT 
Tyr 
250 


GTG 
Val 


TGT 

Cys 


GTT 
Val 


TAC 
Tyr 


GAA 
Glu 
255 


CGC 
Arg 


GGT 
Gly 


GGC 
Giv 


CGT 
Arg 


CAG 
Gin 
260 


GAA 
Glu 


GAC 
Asp 


GAC 
Asp 


TGG 
Trp 


CTG 
Leu 
265 


CCG 
Pro 


ATA 
He 


CCA 
Pro 


CTG 
Leu 


AGC 
Ser 
270 


Phe 


CCA 
Pro 


GAA 
Glu 


GAG 
Glu 


Pro 
275 


Val 


Pro 


CCG 
Pro 


CCA 
Pro 


CCG 
Pro 
280 


GGC 
Gly 


TTA 
Leu 


GTG 
Val 


TC 
Phe 


ATG 
Met: 
285 


GAC 
Asp 


GAC 
Asp 


TTG 
Leu 


TTC 
Phe 


ATT 

He 
290 


AAC 
Asn 


ACG 
Thr 


AAG 
Lys 


CAG 
Gin 


TGC 
Cys 
295 


GAC 
Asp 


TTT 
Phe 


GTG 
Val 


GAC 
Asp 


ACG 
Thr 
300 


CTA 
Leu 


GAG 
Glu 


GCC 
Ala 


GCC 
Ala 



24C 



28E 



336 



384 



432 



480 



52S 



576 



62 4 



672 



72 0 



766 



Sic 



B64 



912 
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TGT CGC ACG CAA- GGC TAG AGG TTG AGA CAG CGC GTG CCT GTC GCC ATT 960 

Cys Arg Thr Gin Gly Tvr Thr Leu Arg Gin Arg Val Pro Val Ala lie 

305 310 315 320 

CCT CGC GAC GCG GAA ATC GCA GAC GCA GTT AAA TCG CAC TIT TTA GAG 10 OB 

Pro Ara Asp Ala Glu lie Ala Asp Ala Val Lys Ser His Phe Leu Glu 
325 330 335 

GCG TGC CTA GTG TTA CGG GGG CTG GCT TCG GAG GCT AGT GCC TGG ATA 10 56 

Ala Cvs Leu Val Leu Arg Gly Leu Ala Ser Glu Ala Ser Ala Trp lie 
340 345 350 

AGA GCT GCC ACG TCC CCG CCC CTT GGC CGC CAC GCC TGC TGG ATG GAC 1104 
Arg Ala Ala Thr Ser Pro Pro Leu Gly Arg His Ala Cys Trp Met Asp 
355 360 365 

GTG TTA GGA TTA TGG GAA AGC CGC CCC CAC ACT CTA GGT TTG GAG TTA 1152 
Val Leu Gly Leu Trp Glu Ser Arg Pro His Thr Leu Gly Leu Glu Leu 
370 375 380 

CGC GGC GTA AAC TGT GGC GGC ACG GAC GGT GAC TGG TTA GAG ATT TTA 12 00 

Arg Glv Val Asn Cys Gly Gly Thr Asp Gly Asp Trp Leu Glu lie Leu 
385 * 390 395 400 

AAA CAG CCC GAT GTG CAA AAG ACA GTC AGC GGG AGT CTT GTG GCA TGC 124 8 

Lys Gin Pro Asp Val Gin Lys Thr Val Ser Gly Ser Leu Val Ala Cys 
405 410 415 

GTG ATC GTC ACA CCC GCA TTG GAA GCC TGG CTT GTG TTA CCT GGG GGT 12 96 

Val lie Val Thr Pro Ala Leu Glu Ala Trp Leu Val Leu Pro Gly Gly 
420 425 430 

TTT GCT ATT AAA GCC CGC TAT AGG GCG TCG AAG GAG GAT CTG GTG TTC 1344 
Phe Ala lie Lys Ala Arg Tyr Arg Ala Ser Lys Glu Asp Leu Val Phe 
435 440 445 

ATT CGA GGC CGC TAT GGC TAG 13 6 5 

lie Arg Gly Arg Tyr Gly 
450 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4 54 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:33: 

Met: Asd Ala His Ala lie Asn Glu Arg Tyr Val Gly Pro Arg Cys His 
I ' 5 10 15 

Arc Leu Ala His Val Val Leu Pro Arg Thr Phe Leu Leu His His Ala 
20 25 30 

He Pro Leu Glu Pro Glu He He Phe Ser Thr Tyr Thr Arg Phe Ser 
35 40 45 

Arc Ser Pro Glv Ser Ser Arg Arg Leu Val Val Cys Gly Lys Arg Val 
" 50 55 60 

Leu Pro Gly Glu Glu Asn Gin Leu Ala Ser Ser Pro Ser Gly Leu Ala 
65 70 75 80 
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Leu Se^ Leu Pro Leu Phe Ser His Asp Gly Asn Phe His Pro Phe Asp 

85 90 95 

T le Se- Val Leu Arg lie Ser Cys Pro. Gly Ser Asn Leu Ser Leu Thr 
100 105 110 

Val Arg Phe Leu Tyr Leu Ser Leu Val Val Ala Met Gly Ala Gly Arg 
115 120 125 

Asn Asn Ala Arg Ser Pro Thr Val Asp Gly Val Ser Pro Pro Glu Gly 
130 135 140 

Ala Val Ala His Pro Leu Glu Glu Leu Gin Arg Leu Ala Arg Ala Thr 
145 150 155 160 

Pro Aso Pro Ala Leu Thr Arg Gly Pro Leu Gin Val Leu Thr Gly Leu 
165 170 175 

Leu Arg Ala Gly Ser Asp Gly Asp Arg Ala Thr His His Met. Ala Leu 
180 185 190 

Glu Ala Pro Gly Thr Val Arg Gly Glu Ser Leu Asp Pro Pro Val Ser 
195 200 205 

Gin Lvs Glv Pro Ala Arg Thr Arg His Arg Pro Pro Pro Val Arg Leu 
210 * 215 220 

Ser Phe Asn Pro Val Asn Ala Asp Val Pro Ala Thr Trp Arg Asp Ala 
225 230 235 240 

Thr Asn val Tyr Ser Gly Ala Pro Tyr Tyr Val Cys Val Tyr Glu Arg 
245 250 255 

G">v Gly Arg Gin Glu Asp Asp Trp Leu Pro lie Pro Leu Ser Phe Pro 
260 265 270 

Glu Glu Pro Val Pro Pro Pro Pro Gly Leu Val Phe Met Asp Asp Leu 
275 280 285 

Phe lie Asn Thr Lys Gin Cys Asp Phe Val Asp Thr Leu Glu Ala Ala 
290 295 300 

Cys Arg Thr Gin Gly Tyr Thr Leu Arg Gin Arg Val Pro Val Ala lie 
305 310 315 320 

Pro Arg Asp Ala Glu He Ala Asp Ala Val Lys Ser His Phe Leu Glu 
325 330 335 

Ala Cvs Leu Val Leu Arg Gly Leu Ala Ser Glu Ala Ser Ala Trp He 
340 345 350 

Arg Ala Ala Thr Ser Pro Pro Leu Gly Arg His Ala Cys Trp Met Asp 
355 360 365 

Val Leu Glv Leu Trp Glu Ser Arg Pro His Thr Leu Gly Leu Glu Leu 
370 * 375 380 

Arq Glv Val Asn Cys Gly Gly Thr Asp Gly Asp Trp Leu Glu He Leu 
385 " 390 395 400 

Lvs Gin Pro Asp Val Gin Lvs Thr Val Ser Gly Ser Leu Val Ala Cys 
405 410 415 

Val He Val Thr Pro Ala Leu Glu Ala Trp Leu Val Leu Pro Gly Gly 
420 425 430 

Phe Ala He Lys Ala Arg Tyr Arg Ala Ser Lys Glu Asp Leu Val Phe 
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435 440 445 

He Arg Gly Arg Tyr Gly 
450 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 984 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: N 
(iv) ANTI-SENSE: N 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1 . . 984 
(D) OTHER INFORMATION : 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

ATG TTT GCT TTG AGC TCG CTC GTG TCC GAG GGT GAC CCG GAG GTG ACC 4 8 

Met Phe Ala Leu Ser Ser Leu Val Ser Glu Gly Asp Pro Glu Val Thr 
! 5 10 IS 

AGT AGG TAC GTC AAG GGC GTA CAA CTT GCC CTG GAC CTT AGC GAG AAC 96 
Ser Arg Tyr Val Lys Gly Val Gin Leu Ala Leu Asp Leu Ser Glu Asn 
20 25 30 

ACA CCT GGA CAA TTT AAG TTG ATA GAA ACT CCC CTG AAC AGC TTC CTC 144 
Th- Pro Gly Gin Phe Lys Leu He Glu Thr Pro Leu Asn Ser Phe Leu 
35 40 45 

TTG GTT TCC AAC GTG ATG CCC GAG GTC CAG CCA ATC TGC AGT GGC CGG 192 
Leu Val Ser Asn Val Met Pro Glu Val Gin Pro He Cys Ser Gly Arg 

50 .55 60 

CCG GC~ TTG CGG CCA GAC TTT AGT AAT CTC CAC TTG CCT AGA CTG GAG 24 0 

Dro Ala Leu Arg Pro Asp Phe Ser Asn Leu His Leu Pro Arg Leu Glu 
"65 7 ° 75 80 

AAG C~C CAG AGA GTC CTC GGG CAG GGT TTC GGG GCG GCG. GGT GAG GAA 28 B 

Lvs Leu Gin Arg Val Leu Gly Gin Gly Phe Gly Ala Ala Gly Glu Glu 

85 90 -95 

GCA CTG GAC CCG TCT CAC GTA GAA ACA CAC GAA AAG GGC CAG GTG 336 
lie Ala Leu Asp Pro Ser His Val Glu Thr His Glu Lys Gly Gin Val 
100 i° 5 110 

— „ ^ r CAC TAT G CT ACC GAG GAG TGG ACG TGG GCT TTG ACT CTG 

Phe TV- Asn His Tyr Ala Thr Glu Glu Trp Thr Trp Ala Leu Thr Leu 
* 115 120 125 

AAT AAG GAT GCG CTC CTT CGG GAG GCT GTA GAT GGC CTG TGT GAC CCC 
Asn Lvs Asp Ala Leu Leu Arg Glu Ala Val Asp Gly Leu Cys Asp Pro 
130 135 140 

GGA ACT TGG AAG GGT CTT CTT CCT GAC GAC CCC CTT CCG TTG CTA TGG 
Gly Thr Trp Lys Gly Leu Leu Pro Asp Asp Pro Leu Pro Leu Leu Trp 
145 150 155 



384 



432 



480 
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CTG CTG TTC 
Leu Leu Phe 



TAC AAG CAG 
Tyr Lys Gin 



ATG TAC GCT 
Met Tvr Ala 
195 

AAG TAC ACC 
Lys Tyr Thr 
210 

GCT TGC CGC 
Ala Cys Arg 
225 

CAG ATG AAA 
Gin Met Lys 



CTC TTG TGT 
Leu Leu Cys 



AAC GGA CCC GCC 
Asn Gly Pro Ala^ 
165 

CAC TGC GGT TAC 
His Cys Gly Tyr 
180 

CCC AAA CGG GAT 
Pro Lys Arg Asp 



ACC 
Thr 



ATA 
lie 



CTA 
Leu 
305 

GAG 
Glu 



CAC GTG 
His Val 
27S 

ACA GGC 
Thr Gly 
290 

ATC ATC 
lie lie 



AAT GGA 

Asn Gly 



AAG TTT 
Lys Phe 



CCG CCA 
Pro Pro 



ATC ATA 
lie lie 
245 

CAC ATA 
His lie 
260 

GGT GGA 
Gly Gly 



CTA 
Leu 



Phe 
230 

GAT 
Asp 



GA5 
Ast 



TAT CAG 
Tyr Gin 



ATC CTA 
lie Leu 



AAT GTT CAG . ACC 
Asn Val Gin Thr 
295 

CCA TCG TAT GAC 
Pro Ser Tyr Asp 
310 

CTC AAC CAA CTC 
Leu Asn Gin Leu 
325 



TGT. CGG GCC 
Cys Arg Ala 
170 

CCG GTG CTA 
Pro Val Leu 



TCG TTC GTT 
Ser Phe Val 



TTT TCC GGG 
Phe Ser Gly 
220 

CGG ATA CAA 
Arg lie Gin 
235 

ACT TAC ATT 
Thr Tyr lie 

250 ' 

\AGC ATA ATT 
Ser lie lie 



3T GGA AAA 

it Gly Lys 



TGT CCA ACT 
Cy\ Pro Thr 
300 

GCG\ATC ATC 
Ala \lle lie 
ilS 



GAC TGT TGC CTG 
Asp Cys Cys Leu 
175 

CTT CCA GGT CAC 
Leu Pro Gly His 
190 

AAT CAT GCC CTG 
Asn His Ala Leu 
205 

ACA TGG GCG GCG 
Thr Trp Ala Ala 



AGG GTA GTG 
Arg Val Val 



TCC CAC 
Ser His 



GCG 
Ala 



GGG 
Gly 
285 

ACG 
Thr 



GGT 
Gly 
270 

ACC 
Thr 



GGC 

Gly 



ACC 
Thr 
255 

CAG 
Gin 



CAG 
Gin 



GAC 
Asp 



AGT 
Ser 

240 

TGC 
Cys 



GGG 
Gly 



TAT 
Tyr 



TAT 

Tyr 



ACC ATG ATC AAG 
Thr Met lie Lys 
320 



TAA 



528 



576 



624 



671 



72C 



768 



816 



B64 



912 



96 0 



984 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 327 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 ! 

Met Phe Ala Leu Ser Ser Leu Val Ser Glu Gly 

■ 1 5 10 



\v Pro Glu Val Thr 
15 



Se- Arc Tvr Val Lys Glv Val Gin Leu Ala Leu Asa Leu Ser Glu Asn 
- - _. ~- x 30 



20 



25 



Thr 



Leu 



Pro Gly Gin Phe Lys Leu lie Glu Thr Pro Leu Vi sn Ser Phe Leu 



35 



40 



Val Ser Asn Val Met Pro Glu Val Gin Pro He 
50 55 60 



Ser Gly Arg 

\ 
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Pro Ala Leu 
65 

Lys Leu Gin 

He Ala Leu 

Phe Tvr Asn 
115 

Asn Lvs Asp 
130 

Gly Thr Trp 
145 

Leu Leu Phe 

Tyr Lys Gin 

Met Tyr Ala 
195 

Lys Tvr Thr 
210 

Ala Cys Arg 
225 

Gin Met Lys 
Leu Leu Cvs 



Thr His Val 
275 

He Thr Gly 
290 

Leu He He 
305 

Glu Asn Gly 



Arg Pro 



Arg Val 
65 

Asd Pro 

100 



Asd \Phe 

70 



Ser Asn 



Leu Guy Gin Gly 



Ser HiV 



His Tyr Ala Thr 
Ala Leu 
Lys Gly 



Leu Arg 
135 



Asn Gly 
165 

His Cys 
180 

Pro Lvs 



Lys Phe 

Pro Pro 

lie He 
245 

His He 
260 

Gly Gly 
Asn Val 
Pro Ser 



Leu Asn 
325 



Leu Leu 

150 

Pro Ala 

Gly Tyr 

Arg Asp 

Leu Tyr 
215 

Phe Ala 
230 

Asp Ala 
Tyr Gin 
He Leu 



Gin Thr 
295 

Tyr Asp 
310 

Gin Leu 



Val Glu 

105 

Glu Glu 

t 12 0 

;iu Ala 

:o Asp 

SeW Phe 

Prc\ Glv 
185 

Leu \Leu 

200 

Gly Jfep 

Thr Se\ 

Ser Asp 1 

Gin Asn 
265 

Leu Leu 

280 

Gin Arg 
He Pro 



Leu His Leu Pro 

75 

Phe Glv Ala Ala 
90 

Thr His Glu Lys 



Trp Thr Trp Ala 
125 

Val Asp Gly Leu 
140 

Asp Pro Leu Pro 
155 

Cys Arg Ala Asp 

170 

Pro Val Leu Leu 



Ser Phe Val Asn 
205 

Phe Ser Gly Thr 
220 

Arg He Gin Arg 
235 

{ Thr Tyr He Ser 
!50 

Ber He He Ala 



Arg Leu Glu 

80 

Gly Glu Glu 
95 

Gly Gin Val 
lib 

Leu Thr Leu 



S^r Gly Lys Gly 
285 

Cv\ Pro Thr Thr 
300 

Ala\He He Thr 

L315 



Cys Asp Pro 



Leu Leu Trp 
160 

Cys Cys Leu 
175 

Pro Gly His 
190 

His Ala Leu 



Trp Ala Ala 



Val Val Ser 
240 

His Thr Cys 
255 

Gly Gin Gly 
270 

Thr Gin Tyr 



Gly Asp Tyr 



Met He Lvs 

320 



INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 330 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: N 

(iv) ANTI-SENSE: N 

(>:i) SEQUENCE DESCRIPTION: SEQ ID NO:36: 



WO 96/06159 



\ 



PCT/US95/10194 



242 

GGATCCCTCT GACAACCTTC AGATAA^kAAA "CGTATATGCC CCCTTTTTTC AGTGGGACAG 60 

CAACACCCAG CTAGCAGTGC TACCCCCfeTT TTTTAGCCGA AAGGATTCCA CCATTGTGCT 12 0 

CGAATCCAAC GGATTTGACC CCGTGTTaCC CATGGTCGTG CCGCAGCAAC TGGGGCACGC 18 0 

TATTCTGCAG C AG CTGTTGG TGTACCACAT CTACTCCAAA ATATCGGCCG GGGCCCCGGA 24 0 

TGATGTAAAT ATGGCGGAAC TTGATCTAtV TACCACCAAT GTGTCATTTA TGGGGCGCAC 3 00 

ATATCGTCTG GACGTAGACA ACACGGATCCt 33 0 

iZ) INFORMATION FOR SEQ ID NO^: 1 

(i) SEQUENCE CHARACTERISTICS :) 

(A) LENGTH: 627 base pai^s 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic 

(iii) HYPOTHETICAL: N 

(iv) ANTI-SENSE: N 

(xi) SEQUENCE DESCRIPTION: SEQ ID Nt>:37: 

GGATCCGCTG GCAGGTGGGC GCGCACCTCG TCGGGT«GCT TGGAGACAAA CAGCTCCAGG 6 0 

CCAGTCCGCG CCGTAGCGCC TGCAGGTGCC TCACCACCfeG GGCCGGGTCA TGCGATCTGT 12 0 

TTAGTCCGGA GAAGATAGGG CCCTTGGGAA GCCGCTGA^C CAGCTCCAGG GTCTCCAAGA 18 0 

TGCGCACCGG TTGTCGGAGC TGTCGCGATA GAGGTTAGGGi TAGGTGTCCG GTCCGTCCGT 24 0 

GGGCTCAAAC CTGCCCAGAC ACACCACTGT CTGCTGGGGG \ATCATCCTTC TCAGGGAGAT 3 00 

GCATTCTTTG GAAGTAGTGG TAGAGATGGA GCAGACTGCC AGGGCGTTGC AGGAGTGGTG 36 0 

GCGATGGTGC GCACCGTTTT TAAGAAACCC CCCAGGGTGG G($ACTCCCGC TCCCTGCAGC 42 0 

ATCTCGGCCT GCTGTACGTC CTTGG CGAAT ATGCGACGAA AT^GGCTGTG CGCACGGGGT 4 B0 

CCCAGGGCCG GTCCGGTGGC ATACAGGCCG GTGAGGGCCC CCT&GGTCTG TCCGCCTGGA 54 0 

AACAGGGTGC TGTGAAACAA CAGGTTGCAA GGCCGCGAAT ACCCCTCTGC ACGCTGCTGT 600 

GGACGTGGGT GTATGCTCCG TGGATCC \ 62 7 



(2i INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: N 

(iv) ANTI- SENSE: N 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 



WO 96/06159 



PCT/US95/10194 



\ 243 

AGCCGAAAGG ATT C CAC CAT TGTGCTtGAA TCCAACGGAT TTGACCCCGT GTTCCCCATG 60. 

GTCGTGCCGC AGCAACTGGG GCACGCTUOT CTGCAGCAGC TGTTGGTGTA CCACATCTAC 12 0 

TCCAAAATAT CGGCCGGGGC CCCGGATGAT GTAAATATGG CGGAACTTGA TCTATATACC ISO 

ACCAATGTGT CATTTATGGG GCGCACATAT CGTCTGGACG TAGACAACAC GGA 23 3 

(2) INFORMATION FOR SEQ ID NO : 3 a : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 8 base paUrs 

(B) TYPE: nucleic acid \ 
<C) STRANDEDNESS : singlA 
(D) TOPOLOGY: linear \ 

(ii) MOLECULE TYPE : DNA (genomic) 

(iii) HYPOTHETICAL; N \ 

(iv) ANTI -SENSE: N \ 

(xi) SEQUENCE DESCRIPTION: SEQ Id\nO:39: 



GAAATTAC C C 


ACGAGATCGC 


TTCCCTGCAC 


ACCGCACTTG 


gctactcatc 


AGTCATCGCC 


60 


CCGGCCCACG 


TGGCCGCCAT 


AACTACAGAC 


atgggaSgtac 


attgtcagga 


CCTCTTTATG 


120 


ATTTTCCCAG 


GGGACGCGTA 


TCAGGACCGC 


CAGCTGCATG" 


actatatcaa 


AATGAAAGCG 


180 


GGCGTGCAAA 


CCGGCTCACC 


GGGAAACAGA 


atggatcAcg 


TGGGATACAC 


TGCTGGGGTT 


240 


CCTCGCTGCG 


AG AACCTG C C 


CGGTTTGAGT 


catggtcagc 


TGGCAACCTG 


CGAGATAATT 


300 


CCCACGCCGG 


TCACATCTGA 


CGTTGCCT 








328 



(2) INFORMATION FOR SEQ ID NO: 40: \ 

(i) SEQUENCE CHARACTERISTICS : \ - - ~ 

(A) LENGTH: 132 base pairs \ 

(B) TYPE: nucleic acid \ 

(C) STRANDEDNESS: single \ 

(D) TOPOLOGY: linear \ 

(ii) MOLECULE TYPE: DNA (genomic) \ 
(iii) HYPOTHETICAL: N \ 

(iv) ANTI-SENSE: N \ 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: \ 
AACACGTCAT GTG CAGGAGT GACATTGTGC CG CGGAG AAA CTCAGAICGC ATCCCGTAAC 6C 
CACACTGAGT GGGAAAATCT GCTGGCTATG TTTTCTGTGA TTATCTaVgC CTTAGATCAC 12 0 
AACTGTCACC CG \ 132 

(2) INFORMATION FOR SEQ ID NO: 41: \ 

(i) SEQUENCE CHARACTERISTICS: \ 

(A) LENGTH: 4 0 base pairs \ 

(B) TYPE: nucleic acid \ 

(C) STRANDEDNESS: single \ 
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(D ) TOPOLOGY :V 1 inear 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: N 
(±v) ANTT- SENSE : N 

(xi) SEQUENCE DESCRIPTION^ SEQ ID NO: 41 
AGCCGAAAGG ATTCCACCAT TCCGTGTTGXT CTACGTCCAG 

(2) INFORMATION FOR SEQ ID NO: 42^ 

<i) SEQUENCE CHARACTERISTICS^ 

(A) LENGTH: 3 8 base paia 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : sxnglel 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic 

(iii) HYPOTHETICAL: N 

(iv) ANTI -SENSE: N 

(xi) SEQUENCE DESCRIPTION: SEQ ID 
GAAATTACCC ACG AGATCG C AGGCAACGTC AGATGT 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: N 

(iv) ANTI -SENSE : N 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:43 : 
AACACGTCAT GTGCAGGAGT GACCGGGTGA CAGTTGTGAT CtAgG 



40 



38 



46 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: N 
(iv) ANTI -SENSE : N 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
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245 



A 

GGG T 



(2) INFORMATION FO^ SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: laucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGV: linear 

(ii) MOLECULE TYPE DNA (genomic] 
(iii) HYPOTHETICAL: 
Uv) ANTI-SENSE: N 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45 
AGTTGCAAAC C AG AC CTCAG 



20 



\ 



